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of Hucleie '^"^'^ Amplification 

, inter alia, to the amplification of nucleic 

This invention relates, inter aiia. 

acids. 

Molecular biology and pharmaceutical drug development now make 
Molecular bxo J (Friedrich, G.A. Moving beyond 

intensive use of nucleic a i ,.10,4 (1996)) The most 

the genome projects, *.ature Biotechnology 14, 1234 ^5^«> 

chalLnging areas are whole genome sequencing, single nucleotide 
challenging ^^^^ expression monitoring. 

^°^^°:f ":;:rh;nrr as of thousands of samples are handled in 
currently, up to hundrea ^ ^^^^^ ^ 

single DNA sequencing projects (Venter, J'^-' H-° 
new strategy for genome sequencing. Nature 381, 364 (1996)). This 
c apac ty i li^tL by the available technology. Projects like the 
capacity sequencing) and 

:r::frinrarprymor:Lms m e^ressed genes involved in common 
ZeJs imply the sequencing of millions of DNA samples. 

• . . .f the current DNA sequencing technologies, it is simply 
t pTs i. o - ease indefinitely the time required to process a 
r sa^ole A way of increasing throughput is to perform many 
single sample. A way .^^ ..^.otic sample 

p^ocesses -/J^ J /^^^ 33, ..n plates, high density 
preparation ^« -^Z/' ^ ^.k. Ahmadi, J. Curtis, 

rre:;ac::riic^^^^^^^^ — - -^^^r.— 

Biotechnology 35,191 (1994)) ^^^^^^ ^_ 

aensity oligonucleotide -rays C e ^^'^-^^......S. Morris, and 

BerncX.C. Huang,D. Stern,J. Winkler, hiah-density DNA 

S p A Fodor, Accessing genetic information with high density 
a^ravi Scie ce 274 (5287) :610-614, (1996)) are starting to bring 
we 's .e demand in ever higher throughput. Such technologies 
low up to 50,000-100,000 samples at a time to be processed within 
ts an: even hours (Maier, Kobotic ^^^^^^^^ 
screening. Laboratory Robotics and Automation 7, 123 (1995)). 



1 



PCT/GB98/00961 

WO 98/44151 



in .est known methods for perfonaing nucleic acid analysxs xt 
1 cessary to first extract t.e nucleic acids of xnterest (e.g., 
:.ic or .itocHondrial or messenger KH. > ^ 

^,n, Then it is necessary to isolate the nucleic acxds of 
organism. ^^^^^ ..^^s and usually, to 

interest fro. ^^^^HH ^^,„,,,,3S suitable for their 

:raira:iran:;: -ection. xsolatm. the nucleic fragments 
has been considered necessary even when one is interested in a 
"re entatLe but rando. set of all of the different nucleic acids 

instance a representative set of all the .RHAs present in a cell 
for instance, rep ^NA has been cut 

or of all the fragments obtained aicer ye 
randomly into small pieces. 

1 ...hods can be used to amplify DNA with biological means and 
" by hose sKilled in the art. Generally, the fragments 
are well ^^°^ J ...^ors with the use of restriction 

of OKA - , ,.,,,ining a fragment of interest 

^"Tent ntr rc Anto a biological host and amplified by means 
:rwe 11 e a Ushed protocols. Usually hosts are randomly spread over 

J wth medium (e.g. agar plates). They can then replicate .o 
p/ovide colonies that originated from individual host cells. 

n to millions of simultaneous amplification of cloned DNA fragments 
' I carried out simultaneously in such hosts. The density of 
can be carried out s ,,,er to obtain DNA from 

rr:::o::e:^nro::ion is to transfer the colonies to a membrane, 

. • „,ohilise the DNA from within the biological hosts 

and then to immobiiise tne uwn 

,,7961 11975),. With t.ese options ho..v«, the ..ount " 
ZZLTl 1. u.uea ..a o«,n l.,u„lcl,« .o. „o„-.a.o,ct.,. 

detection. 

. to twn^tst by t«,ch>.iqv.e Individually each 

A„otn« option i= to t«n=£« y ^^^^^^^ ^^^^ 

.oiony into ' ™ ^ l;', o„ P, oPt.in.d «to. tpa 
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cells with an appropriate purification process. However such a 
procedure is generally time and labour consuming, and difficult to 

automate . 

The revolutionary technique of DMA amplification using the polymerase 
chain reaction (PGR) was proposed in 1985 by Mullis et al. (Saiki, 
R , S. Scharf, r. Faloona, K. Mullis, G. Horn, H. Erlich and N. 
Arnheim, Science 230, 1350-1354 (1985) and is now well Icnown by those 
skilled in the art. In this amplification process, a DNA fragment of 
interest can be amplified using two short (typically about 20 base 
long) oligonucleotides that flank a region to be amplified, and that 
are usually referred to as "primers". Amplification occurs during 
the PGR cycling, which includes a step during which double stranded 
DNA molecules are denatured (typically a reaction mix is Seated e.g. 
to 95'G in order to separate double stranded DNA molecules into two 
Single stranded fragments), an annealing step (where the reaction mix 
is brought to e.g., 45-C in order to allow the primers to anneal to 
the single stranded templates) and an elongation step (DNA 
complementary to the single stranded fragment is synthesised via 
sequential nucleotide incorporation at the ends of the primers with 
the DNA polymerase enzyme) . 

The above procedure is usually performed in solution, whereby neither 
the primers nor a template are linked to any solid matrix. 

More recently, however, it has been proposed to use one primer 
grafted to a surface in conjunction with free primers in solution m 
order to simultaneously amplify and graft a PGR product onto the 
surface (Oroskar, A.A., S.E. Rasmussen, H.N. Rasmussen. S.R. 
Rasmussen,B.M. Sullivan, and A. Johansson, Detection of immobilised 
amplicons by ELISA-like techniques. Clinical Chemistry 42:1547 
(1996)). (The term "graft" is used herein to indicate that a moiety 
becomes attached to a surface and remains there unless and until it 
is desired to remove it.)The amplification is generally performed in 
containers (e.g., in 96 well format plates, in such a way that each 
container contains the PGR product (s) of one reaction. With such 
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methods, some of the PCR product become grafted to a surface of the 
container having primers therein which has been in contact with the 
reactant during the PCR cycling. The grafting to the surface 
simplifies subsequent assays and allows efficient automation. 

Arraying of DNA samples is more classically performed on membranes 
(e.g., nylon or nitro-cellulose membranes). With the use of suitable 
robotics (e.g., Q-bot- , Genetix ltd, Dorset BH23 3TG OK) it is 
possible to reach a density of up to 10 samples W. Here, the DNA is 
covalently linked to a membrane by physicochemical means (e.g., OV 
irradiation) . These technologies allow the arraying of large DNA 
molecules (e.g. molecules over 100 nucleotides long) as well as 
smaller DNA molecules. Thus both templates and probes can be 
arrayed. 

New approaches based on pre-arrayed glass slides (arrays of reactive 
areas obtained by ink-jet technology (Blanchard, A. P. and L. Hood, 
Oligonucleotide array synthesis using ink jets. Microbial and 
comparative Genomics, 1:225 (1996)) or arrays of reactive 
polyacrylamide gels (Yershov, G. et al., DNA analysis and diagnostics 
on oligonucleotide microchips, Proceedings of the National Academy of 
Science^ USA. 93:4913-4918 (1996)) allow the arraying of up to 100 
samples W. With these technologies, only probe (oligonucleotide) 
grafting has been reported. Reported number of samples/mm^ are still 
fairly low (25 to 64) . 

Higher sample densities are achievable by the use of DNA chips, which 
can be arrays of oligonucleotides covalently bound to a surface and 
can be obtained with the use of micro-lithographic techniques (Fodor, 
S.P.A. et al., Light directed, spatially addressable parallel 
chemical synthesis. Science 251:767 (1991) ) . Currently, chips with 
625 probes/mm^ are used in applications for molecular biology 
(Lockhart, D.J. et al., Expression monitoring by hybridisation to 
high-density oligonucleotide arrays, Nature Biotechnology 14:1675 
(1996) ) . Probe densities of up to 250 000 samples/cm^ are claimed to 
be achievable (Ghee, M. et al.. Accessing genetic information with 
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.«.v.. : 

direct ,oUd P-"" ,,ip, have been used to 

::::;e"r;rJtir::::= - - - • - 

polymerase-mediated elongation step. 

i^nkPd to the vessel in which PCR amplification 
„„.„ Pca Ptodu. = 2 " 111 a= e a.„ct .„.Vi« P-cess. - 
takes pl.-, th.= oaa be ^^^^ 

rrtetTet. .o».t. ,.e.. .Uo. P».V — 
PCR products/™' of eurt.ce to be obtained. 

,,.i.llv available Hucleolink- syate. obtainable fro. 
„.i„, the co-.erci.Uy avail ^^^^^^ ei.nlt.neo«e 

„„„c / .^r. in containers on the surface of 

:::r:ri::ri::trd: 'prir. ..e bee. ..ft.^ - ^ 

e.s. the --;';:rt:^ro:rs'.;i:sW uUievabl. for the 
vessel- presently a density o .^^ difficult. This is 

„ .ii Plate --^ J— riiriit 3,. .... pi.«= 

apparent J« ^^^^^^ J p,, nas t.en delayed due to tecbnic.l 

,0.08 -P'""-;^;";;rs;er .nd cpiUary effects durln, fiUin,, . 
problem (e.,. heat «»»»^ i^rove„e„ts in the 

LritH: Cl """U - — - -""'^ 

foreseeable future. 
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A. providing a plurality of primers that are immobilised but that 

have one end exposed to allow primer extension; 
B allowing a single stranded target nucleic acid molecule to 
anneal to one of said plurality of primers over part of the 
length of said single stranded nucleic acid molecule and then 
extending that primer using the annealed single stranded 
nucleic acid molecule as a template, so as to provide an 
extended immobilised nucleic acid strand; 
C. separating the target nucleic molecule from the extended 

immobilised nucleic acid strand; 
D allowing the extended immobilised nucleic acid strand to anneal 
to one of said plurality of primers referred to in step A) and 
then extending that primer using the extended immobilised 
nucleic acid strand as a template, so as to provide another 
extended immobilised nucleic acid strand; and optionally, 
E. separating the annealed extended immobilised nucleic acid 
strands from one another. 

Preferably the method also comprises the step of: 

F using at least one extended immobilised nucleic acid strand to 
repeat steps D) and E) , so as to provide additional extended 
immobilised nucleic acid strands and, optionally, 

G. repeating step F) one or more times. 

Desirably the single-stranded target nucleic acid sequence is 
provided by a method in which said single-stranded target nucleic 
acid is produced by providing a given nucleic acid sequence to be 
amplified (which sequence may be known or unknown, and adding thereto 
a first nucleic acid sequence and a second nucleic acid sequence; 
wherein said first nucleic acid sequence hybridises to one of said 
plurality of primers and said second nucleic acid sequence is 
complementary to a sequence which hybridises to one of said plurality 
of primers- 

The second nucleic acid sequence may be a sequence that is the same 
as the sequence of one of the plurality of primers. Thus the single- 
stranded target nucleic acid squence may be provided by a method in 
«hich said single-stranded target nucleic acid is produced by 
providing a given nucleic acid sequence to be amplified (which 
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sequence may be known or unknown) and adding thereto a first nucleic 
acid sequence and a second nucleic acid sequence; wherein said first 
nucleic acid sequence hybridises to one of said plurality of primers 
and said second nucleic acid sequence is the same as the sequence of 
one of said plurality of primers. 

The first and second nucleic acid sequences may be provided at first 
and second ends of said single-stranded target nucleic acid, although 
this is not essential. 

If desired a tag may be provided to enable amplification products of 
a given nucleic acid sequence to be identified. 

Colonies 

The method of the present invention allows one or more distinct areas 
to be provided, each distinct area comprising a plurality of 
immobilised nucleic acid strands (hereafter called "colonies") . 
These areas can contain large numbers of amplified nucleic acid 
molecules. These molecules may be DNA and/or RNA molecules and may 
be provided in single or double stranded form. Both a given strand 
and its complementary strand can be provided in amplified form in a 
single colony. 

Colonies of any particular size can be provided. 

However, preferred colonies measure from lOnm to 100pm across their 
longest dimension, more preferably from lOOnm to 10pm across their 
longest dimension. Desirably a majority of the colonies present on a 
surface (i.e. at least 50% thereof) have sizes within the ranges 
given above. 

Colonies can be arranged in a predetermined manner or can be randomly 
arranged. Two or three dimensional colony configurations are 
possible. The configurations may be regular {e.g. having a polygonal 
outline or having a generally circular outline) or they may be 
irregular. 
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Colonies can be provided at high densities. Densities of over one 
colony/nun^ of surface can be achieved. Indeed densities of over lOS 
over 10^ or even over 10* colonies /mm^ are achievable using the 
present invention. In preferred embodiments, the present invention 
provides colony densities of 10**^ colonies/mm^, more preferably 
densities of lO^""^ colonies/mm^ thus offering an improvement of 3 to 4 
orders of magnitude relative to densities achievable using many of 
the prior art methods. It is this property of the invention that 
allows a great advantage over prior art, since the high density of 
DNA colonies allows a large number of diverse DNA templates (up to 
10^'' colonies /mm^, to be randomly arrayed and amplified. 

Primers 

The immobilised primers for use in the present invention can be 
provided by any suitable means, as long as a free 3' -OH end is 
available for primer extension. Where many different nucleic acid 
molecules are to be amplified, many different primers may be 
provided. Alternatively "universal" primers may be used, whereby 
only one or two different types of primer {depending upon the 
embodiment of the invention) can be used to amplify the different 
nucleic acid molecules . Universal primers can be used where the 
molecules to be amplified comprise first and second sequences, as 
described previously. The provision of universal primers is 
advantageous over methods such as those disclosed in WO96/04404 
(Mosaic Technologies, Inc.) where specific primers must be prepared 
for each particular sequence to be amplified. 

Synthetic oligodeoxynucleotide primers are available commercially 
from many suppliers (e.g. Microsynth, Switzerland, Eurogentech, 
Belgium) . 

Grafting of primers onto silanized glass or quartz and grafting of 
primers onto silicon wafers or gold surface has been described 
(Maskos, U. and E.M Southern, Oligonucleotide hybridizations on glas 
supports: a novel linker for oligonucleotide synthesis and 
hybridization properties of oligonucleotides synthesised in situ, 
Nucleic Acids Research 20 (7 ): 1679-84 , 1992; Lamture, J.B., et.al. 
Direct-detection of nucleic-acid hybridization on the surface of a 
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charge-coupied-device. Nucleic Acids Research 22 (11) : 2121-2125, 1994; 
Chrisey, L.A., G.U. Lee, and C.E. Oferrall, Covalent attachment of 
synthetic DNA to self-assembled monolayer films. Nucleic Acids 
Research 24 ( 15 ): 3031-3039, 1996). 

Grafting biotinylated primers to supports covered with streptavidin 
is another alternative. This grafting method is commonly used for 
bio-macromoiecules in general. 

Non covalent grafting of primers at the interface between an aqueous 
phase and a hydrophobic phase through an hydrophobic anchor is also 
possible for the present invention. Such anchoring is commonly used 
for bio-macromolecules in general (S. Terrettaz et al. : Protein 
binding to supported lipid membranes, Langmuir 9,1361 (1993)). 
Preferred forms of such interfaces would be liposomes, lipidic 
vesicles, emulsions, patterned bilayers, Langmuir or Langmuir- 
Blodgett films. The patterns may be obtained by directed pattering on 
templates, e.g., silicon chips patterned through micro-lithographic 
methods (Goves, J.T. et al., Micropatterning Fluid Bilayers on Solid 
Supports, in Science 275,651 (1997)). The patterns may also be 
obtained by due to the self-assembly properties of "colloids", e.g., 
emulsions or latex particles (Larsen, A.E. and D.G. Grier, Like 
charge attractions in metastable colloidal crystallites, Nature 
385,230 (1997) ) . 

In the above methods, one, two or more different primers can be 
grafted onto a surface. The primers can be grafted homogeneously and 
simultaneously over the surface. 

Using microlithographic methods it is possible to provide immobilised 
primers in a controlled manner. If direct synthesis of 
oligonucleotides onto a solid support with a free 3' -OH end is 
desired, then micro-lithographic methods can be used to 
simultaneously synthesise many different oligonucleotide primers 
(Pirrung, M.C. and Bradley, J.C. Comparison of methods for 
photochemical phosphoramidite-based DNA- synthesis. Journal Of 
Organic Chemistry 60 (20) : 6270-6276, 1995). These may be provided in 
distinct areas that may correspond in configuration to colonies to be 
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formed, (e.g. they may be several nanometers or micrometers across). 
Within each area, only a single type of primer oligonucleotide need 
be provided. Alternatively a mixture comprising a plurality of 
different primers may be provided. In either case, primers can be 
homogeneously distributed within each area. They may be. provided in 
the form of a regular array. 

Where areas initially comprise only one type of immobilised primer 
they may be modified, if desired, to carry two or more different 
types of primer. One way to achieve this is to use molecules as 
templates for primer extension that have 3' ends that hybridise with 
a single type of primer initially present and that have 5' ends 
extending beyond the 3' ends of said primers. By providing a mixture 
of templates with different sequences from one another, primer 
extension of one type of primer using the mixture of such templates 
followed by strand separation will result in different modified 
primers. (The modified primers are referred to herein as "extended" 
primers in order to distinguish from the ''primary" primers initially 
present on a surface) . 

One, two or more different types of extended primer can be provided 
in this manner at any area where primary primers are initially 
located. Substantially equal portions of different templates can be 
used, if desired, in order to provide substantially equal proportions 
of different types of immobilised extended primer over a given area. 
If different proportions of different immobilised extended primers 
are desired, then this can be achieved by adjusting the proportions 
of different template molecules initially used accordingly. 

A restriction endonuclease cleavage site may be located within the 
primer. A primer may also be provided with a restriction 
endonuclease recognition site which directs DNA cleavage several 
bases distant (Type II restriction endonucleases) , (For the avoidance 
of doubt, such sites are deemed to be present even if the primer and 
its complement are required to be present in a double stranded 
molecule for recognition and/or cleavage to occur.) Alternatively a 
cleavage site and/or a recognition site may be produced when a primer 
is extended. In any event, restriction endonucleases can be useful 
in allowing an immobilised nucleic acid molecule within a colony to 
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be cleaved so as to release at least a part thereof. As an 
alternative to using other restriction endonucleases, ribozymes can 
be used to release at least parts of nucleic acid molecules from a 
surface (when such molecules are RNA molecules)* Other methods are 
possible. For example if a covalent bond is used to link a primer to 
a surface this bond may be broken (e.g. by chemical, physical or 
enzymatic means) . 

Primers for use in the present invention are preferably at least five 
bases long. Normally they will be less than 100 or less than 50 
bases long. However this is not essential. Naturally occurring 
and/or non-naturally occurring bases may be present in the primers. 

Target HucIqIc Acid Molacules 

Turning now to target nucleic acid molecules (also referred to herein 
as templates") for use in the method of the present invention, these 
can be provided by any appropriate means. A target molecule (when in 
single-stranded form) comprises a first part having a sequence that 
can anneal with a first primer and a second part having a sequence 
complementary to a sequence that can anneal with a second primer. In 
a preferred embodiment the second part has the same sequence as the 
second primer. 

The second primer may have a sequence that is the same as, or 
different from, the sequence of the first primer. 

The first and second parts of the target nucleic acid molecules are 
preferably located at the 3' and at the 5' ends respectively thereof. 
However this is not essential. The target molecule will usually also 
comprise a third part located between the first and second parts. 
This part of the molecule comprises a particular sequence to be 
replicated. It can be from any desired source and may have a known 
or unknown (sometimes referred to as "anonymous") sequence. It may 
be derived from random fractionation by mechanical means or by 
limited restriction enzyme digestion of a nucleic acid sample, for 
example , 
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Further parts of the target molecules may be provided if desired. 
For example parts designed to act as tags may be provided. A "tag" 
is defined by its function of enabling a particular nucleic acid 
molecule (or its complement) to be identified. 

Whatever parts are present, target nucleic acid molecules can be 
provided by techniques known to those skilled in the art of nucleic 
acid manipulation. For example, two or more parts can be joined 
together by ligation. If necessary, prior to ligation appropriate 
modifications can be made to provide molecules in a form ready for 
ligation. For example if blunt end ligation is desired then a 
single-strand specific exonuclease such as SI nuclease could be used 
to remove single stranded portions of molecules prior to ligation. 
Linkers and/or adapters may also be used in nucleic acid 
manipulation. (Techniques useful for nucleic acid manipulation are 
disclosed in Sambrook et ai, Molecular Cloning, 2"** Edition, Cold 
Spring Harbor Laboratory Press (1989), for example.) 

Once a template molecule has been synthesised it can be cloned into a 
vector and can be amplified in a suitable host before being used in 
the present invention. It may alternatively be amplified by PCR. As 
a further alternative, batches of template molecules can be 
synthesised chemically using automated DNA synthesisers (e.g. from 
Perkin-Elmer/Applied Biosystems, Foster City, CA) . 

It is however important to note that the present invention allows 
large numbers of nucleic acid molecules identical in sequence to be 
provided in a colony arising from a single molecule of template. 
Furthermore, the template can be re-used to generate further 
colonies. Thus it is not essential to provide large numbers of 
template molecules to be used in colony formation. 

The template can be of any desired length provided that it can 
participate in the method of the present invention. Preferably it is 
at least 10, more preferably at least 20 bases long. More preferably 
it is at least 100 or at least 1000 bases long. As is the case for 
primers for use in the present invention, templates may comprise 
naturally occurring and/or non-naturally occurring bases. 
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Reaction Conditions 

Turning now to reaction conditions suitable for the method of the 
present invention, it will be appreciated that the present invention 
uses repeated steps of annealing of primers to templates, primer 
extension and separation of extended primers from templates. These 
steps can generally be performed using reagents and conditions known 
to those skilled in PCR (or reverse transcriptase plus PCR) 
techniques. PCR techniques are disclosed, for example, in "PCR: 
Clinical Diagnostics and Research", published in 1992 by Springer- 
Verlag. 

Thus a nucleic acid polymerase can be used together with a supply of 
nucleoside triphosphate molecules (or other molecules that function 
as precursors of nucleotides present in DNA/RNA, such as modified 
nucleoside triphosphates) to extend primers in the presence of a 
suitable template. 

Excess deoxyribonucleoside triphosphates are desirably provided. 
Preferred deoxyribonucleoside triphosphates are abbreviated; dTTP 
(deoxythymidine nucleoside triphosphate) , dATP (deoxyadenosine 
nucleoside triphosphate), dCTP (deoxycytosine nucleoside 
triphosphate) and dGTP (deoxyguanosine nucleoside triphosphate) . 
Preferred ribonucleoside triphosphates are OTP, ATP, CTP and GTP. 
However alternatives are possible. These may be naturally or non- 
naturally occurring. A buffer of the type generally used in PCR 
reactions may also be provided. 

A nucleic acid polymerase used to incorporate nucleotides during 
primer extension is preferably stable under the pertaining reaction 
conditions in order that it can be used several times. (This is 
particularly useful in automated amplification procedures.) Thus, 
where heating is used to separate a newly synthesised nucleic acid 
strand from its template, the nucleic acid polymerase is preferably 
heat stable at the temperature used. Such heat stable polymerases 
are known to those skilled in the art. They are obtainable from 
thermophilic micro-organisms. They include the DNA dependent DNA 
polymerase known as Tag polymerase and also thermostable derivatives 
thereof. (The nucleic acid polymerase need not however be DNA 



13 



SUBSTITUTE SHEET (RULE 26) 



wo 98/44151 



PCT/GB98/0096i 



dependent. It may be RNA dependent. Thus it may be a reverse 
transcriptase - i.e. an RNA dependent DNA polymerase,) 

Typically, annealing of a primer to its template takes place at a 
temperature of 25 to 90*C. Such a temperature range will normally be 
maintained during primer extension. Once sufficient time has elapsed 
to allow annealing and also to allow a desired degree of priaer 
extension to occur, the temperature can be increased, if desired, to 
allow strand separation. At this stage the temperature will 
typically be increased to a temperature of 60 to lOO^C. [High 
temperatures can also be used to reduce non-specific priming problems 
prior to annealing. They can be used to control the timing of colony 
initiation, e.g. in order to synchronise colony initiation for a 
number of samples.] Alternatively, the strands maybe separated by 
treatment with a solution of low salt and high pH (> 12) or by using 
a chaotropic salt (e.g. guanidinium hydrochloride) or by an organic 
solvent (e.g. formamide) . 

Following strand separation (e.g. by heating), preferably a washing 
step will be performed. The washing step can be omitted between 
initial rounds of annealing, primer extension and strand separation, 
if it is desired to maintain the same templates in the vicinity of 
immobilised primers. This allows templates to be used several times 
to initiate colony formation. (It is preferable to provide a high 
concentration of template molecules initially so that many colonies 
are initiated at one stage.) 

The size of colonies can be controlled, e.g. by controlling the 
number of cycles of annealing, primer extension and strand separation 
that occur. Other factors which affect the size of colonies can also 
be controlled. These include the number and arrangement on a surface 
of immobilised primers, the conformation of a support onto which the 
primers are immobilised, the length and stiffness of template and/or 
primer molecules, temperature and the ionic strength and viscosity of 
a fluid in which the above-mentioned cycles can be performed. 
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Uses of Colonies 

Once colonies have been formed they can be used for any desired 
purpose . 

For example, they may be used in nucleic acid sequencing (whether 
partial or full), in diagnosis, in screening, as supports for other 
components and/or for research purposes (preferred uses will be 
described in greater detail later on) . If desired colonies may be 
modified to provide different colonies (referred to herein as 
"'secondary colonies" in order to distinguish from the "primary 
colonies" initially formed) . 

Surfaces Comprising^ laaobilised Nucleic Acid Strands 

A surface comprising immobilised nucleic acid strands in the form of 
colonies of single stranded nucleic acid molecules is also within the 
scope of the present invention. 

Normally each immobilised nucleic acid strand within a colony will be 
located on the surface so that an immobilised and complementary 
nucleic acid strand thereto is located on the surface within a 
distance of the length of said immobilised nucleic acid strand (i.e. 
within the length of one molecule) . This allows very high densities 
of nucleic acid strands and their complements to be provided in 
immobilised form. Preferably there will be substantially equal 
proportions of a given nucleic acid strand and its complement within 
a colony. A nucleic acid strand and its complement will preferably 
be substantially homogeneously distributed within the colony. 

It is also possible to provide a surface comprising single stranded 
nucleic acid strands in the form of colonies, where in each colony, 
the sense and anti-sense single strands are provided in a form such 
that the two strands are no longer at all complementary, or simply 
partially complementary. Such surfaces are also within the scope of 
the present invention. Normally, such surfaces are obtained after 
treating primary colonies, e.g., by partial digestion by restriction 
enzymes or by partial digestion after strand separation (e.g., after 
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heating) by an enzyme which digests single stranded DNA) , or by 
chemical or physical means, (e.g., by irradiating with light colonies 
which have been stained by an intercalating dye e.g., ethidiura 



bromide ) 



Once single stranded colonies have been provided they can be used to 
provide double stranded molecules. This can be done, for example, by 
providing a suitable primer (preferably in solution) that hybridises 
to the 3' ends of single stranded immobilised molecules and then 
extending that primer using a nucleic acid polymerase and a supply of 
nucleoside triphosphates (or other nucleotide precursors) . 

Thus surfaces comprising colonies of non-bridged double stranded 
nucleic acid molecules are also within the scope of the present 
invention. (The term "non-bridged" is used here to indicate that the 
molecules are not in the form of the bridge-lilce structures shown in 
e.g. figure Ih.) 

Using the present invention, small colonies can be provided that 
contain large numbers of nucleic acid molecules (whether single or 
double stranded) . Many colonies can therefore be located on a 
surface having a small area. Colony densities that can be obtained 
may therefore be very high, as discussed supra. 

Different colonies will generally be comprised of different amplified 
nucleic acid strands and amplified complementary strands thereto. 
Thus the present invention allows many different populations of 
amplified nucleic acid molecules and their complements to be located 
on._a single surface having a relatively small surface area. The 
surface will usually be planar, although this is not essential. 

Apparatuses 



The present invention also provides an apparatus for providing a 
surface comprising colonies of the immobilised nucleic acid molecules 
discussed supra. 

Such an apparatus can include one or more of the following: 
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„ „.a„s for «.<,.lUsir„ p.i.ers o„ , surf.ce ,=U.ou,h this i. 
not needed if i-»biii=ed pri.e.s 3r. .l«.dy provided, , 



b) 
c) 



supply of a nucleic acid polymerase; 



of the nucleotides to be incorporated 



d) 



e) 



a supply of precursors 
into a nucleic acid (e.g. a supply of nucleos.de 

triphosphates) ; 

.eans for separating annealed nucleic acids (e.g. heating 

means) ; 

and 

control for co-ordi„tin, t.. dl,f«,nt step, retired 

for th. «.thod of th. present indention. 



Otner .pparetuses .re within tne scop. '^;~f tne 

p„s.nt invention to P. .n.lvs.d ™.y - ^ 

^ ^T^H riPtectinq means for decectiny a ^ ^ 

" - — rr :::: zr^zz. ... . 

colonies, as described supra. 

w .for detecting a signal has sufficient resolution 

:r:r. rtrd::t"petln si... ,ener.t.d fro. different 

colonies . 

^apparatuses of the present invention (of whatever nature) are 



drawings . 
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:t shoul. be appreciated that p.oceau.es usin. -A ™oIecules referred 
to in these sections are applicable .utatis .uta.d.s to RNA 
molecules, unless the context indicates otherwise. 

It Should also be appreciated that where sequences are Pr-i<^ed in 
e following description, these are written fro^ to 3' (.o.n, 
fro. left to right,, unless the context indicates otherw.se. 

The figures provided are summarised below. 

FIGURE 1 illustrates a method for the simultaneous 
amplification and immobilisation of nucleic acid molecules 
using a single type of primer. 

FIGURE 2 illustrates how colony growth using a method of the 
present invention can occur. 

FIGURE 3 illustrates the principle of the method used to 
produce DNA colonies using the present invention. It also 
illustrates the annealing, elongation and denaturxng steps that 
are used to provide such colonies. 

FIGURE 4 is an example of DNA colonies formed by amplification 
of a specific template with single primers grafted onto a 

surface. 

FIGURE 5 illustrates a method for the simultaneous 
, amplification and i,«aobilisation of nucleic acid molecules 
using two types of primer. 

FIGURE 6 shows actual D«A colonies produced via the present 
invention. 

FIGURE 7 illustrates a method of the simultaneous amplification 
and i^obilisation of nucleic acid molecules when a target 
.olecule is used as a template having internal se<^ences that 
anneal with primers. 
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Figure 8 illustrates a method to synthesise additional copies 
of the original nucleic acid strands using nucleic acid strands 
present in colonies. The newly synthesised strands are shown 
in solution but can be provided in immobilised form if desired. 

Figure 9 shows the PGR amplification of DNA from DNA found in 
the pre-formed DNA colonies. 

FIGURE 10 illustrates how secondary primers can be generated 
from DNA colonies. 

FIGURE 11 illustrates how secondary DNA colonies can be 
generated from secondary primers. 

FIGURE 12 illustrates how primers with different sequences can 
be generated from a surface functionalised with existing 
primers. 

FIGURE 13 depicts methods of preparing DNA fragments suitable 
for generating DNA colonies. 

FIGURE 14 illustrates a method for synthesising cRNA using the 
DNA colony as a substrate for RNA polymerase. 

. FIGURE 15 illustrates a preferable method to determine the DNA 
sequence of DNA present in individual colonies. 

FIGURE 16 illustrates a method of determining the sequence of a 
DNA colony, de novo. 

FIGURE 17 illustrates the utility of secondary DNA colonies in 
the assay of mRNA expression levels. 

FIGURES 18 and 19 illustrates the use of the secondary DNA 
colonies in the isolation and identification of novel and rare 

expressed genes. 
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Of 



Fiaure la), a surface is provided having attached 
Referrinq now to Figure lei/, a 

I Z I olurality of primers (only one primer is shown for 
thereto a plurality P attached to the surface by a linkage 

simplicity) . Each primer (1) is attachea „on-covalent 
indicated by a dark block. This may be a covalent or a 
TZt L\.o... be sufficiently strong to keep a primer in place 

surface The primers are shown having a short nucleotide 
CenceT; -MT) • - practice however longer se^ences would 

generally be provided. 

Picure lb) shows a target molecule (II) that has annealed to a 

The target molecule comprises at its 3' end a sequence (5 
primer. The target muxc /pr-ATTl At its 

Itt, that is complementary to the primer -^-J ^^^f 
5' end the target molecule comprises a sequence (5 ATT) tha 
! J 1 the primer sequence (although e.act identity is not 
required) . 

^.n.ion is complete, as shown in Figure Id), it can be 
When primer extension is compie , provided that is 

,.h=.h extended immobilised strand (III) is provia 
seen that an exten ^^^^^^ ^^^^^^^^ ,,3, 

complementary to the target heating, 

^ J (T^^m 1-he extended immobilised strana ve.y. 
be separated from the exten extended, 
.s Shown in Figure le)) • This separation step 
l^nobilised strand so that it can then be used to it ate 
subsequent round of primer extension, as shown in Figures 
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. w n^Brf Strand bends over so that one end of 
Here the extended, .rnmobilised strand 

.Hat strand <havin, the terminal -^-^t ^ 

another pri.er (2, 5' -ATT), as shown .n ^^J^^ ' /^^^ 
•^o. 3- end from which primer extension can occur, ^ 

provides a J ena 1.1. ^--m ^ ^t^i-c. Primer 

extension is shown occurring in Figure ig) 

Figure Ih) . 

. u, .ho-, th. two o«,n«=. l-»biU==d strands that ™» shown 
i„ Figure Ihl "P* t„,„„i.„ b. u=.d .s t. .^UMs ih 

.„„h.. .o„„as o. P'-" ,h,io «„ha.d, 

«o.iu,oa st„nds c.h bo ' „ i„ 

?"°-ur 'r:rrr;::s:::hL;%o/„.poh.. . .h. 
.-o h.,. rp 1 . ,i.o. 

complement can De pr distributed within a colony, 

can be substantially homogeneously distribut 

used to initiate several colonies. 

• ^.d that the present invention allows very high 

immobilised molecule. Thus po , within one molecule 

one molecule length of position 1; position 1 is 
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length of position 2; and position 2 is within one molecule length of 
position 4. 

Figure 2 is provided to illustrate how colony growth can occur (using 
the method described with reference to Figure 1 and to Figure 6 or 
any other method of the present invention for providing immobilised 
nucleic acid molecules). 

A flat plate is shown schematically in plan view having primers 
i.mobilised thereon in a square grid pattern (the primers are 
indicated by small dots) . A regular grid is used solely for 
simplicity: in many real cases, the positions of the primers might 
indeed be less ordered or random. 

At the position indicated by arrow X a template molecule has annealed 
to a primer and an initial bout of primer extension has occurred to 
provide an immobilised, extended nucleic acid strand. Following 
strand separation, an end of that strand becomes free to anneal to 
further primers so that additional immobilised, extended nucleic acid 
strands can be produced. This is shown having occurred sequentially 
at positions indicated by the letter Y. For simplicity, the primer 
chosen for annealing is positioned next to the primer carrying the 
nucleic acid strand: in real cases, the nucleic acid strand could 
anneal with a primer which is not its next nearest neighbour. 
However, this primer will obviously be within a distance equal to the 
length of the nucleic acid strand. 

It will be appreciated that annealing at only one (rather than at 
all) of these positions is required for colony cell growth to occur. 

After immobilised, extended, single-stranded nucleic acid molecules 
have been provided at the positions indicated by letter Y, the 
resultant molecules can themselves anneal to other primers and the 
process can be continued to provide a colony comprising a large 
number of immobilised nucleic acid molecules in a relatively small 



area. 
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J„«„in, cvcU. It .... -Pic.= the 'VP---"' ° = « 
^ .ad., as ca„ ba sea„ on the .x.»pla= a^o™ .n 
Tha aimuuanaoua an^UHoation and i™oblli..tion of noclexc ac ds 
"InHoUd Phas. P.«ar, .as bacn s„oc.sa..U. achiavad t.a 
prooadura dasctibad in Exa.plas 1, 2 and 3 b.low: 



EXAMPLE 1 



at 1-heir 5' -termini (Microsynth 

r Tr rn."::: irL^o... p^a.. .o.o.„. 

GmbH, Switzeri The sequence of the oligonucleotide 

wells (Nunc, ^f^; ^/^^ ' ^ , 

: :::ence s^.^^^^a^cco 

p58 corresp 59 ^^^^ prepared as follows. In each 

Microtitre we s w th p 7 or p ^^^^^^ .ugonucleotide in 

Nucleoliak well 30 ^^j' chemicals, St.Louis, MO) was 
10 mM l-methyl-iroidazole (pH 7.0) ti.igm 

,adad. TO aaob -.11, I« .1 o. ,0 "/j^ ^^^^,,.,,, ,o ^ 

.i.«.,l..i„op,op,l, * f , „ „u,on„olaotua.. 

l.„thyl-l»ld.«la. waa addad to th. ao „„„i,^i. 
... .a„ tban a Ud -"^^^^^ 

the incubation, wells were rinse mcubated 15 minutes 

nraH^'pH,.., »aa, «.» .waan .0, ..baa -a„ d.ad 

.t 50-C and -aca sto.ad In a aa.lad plastic bag at 4 C. 

« V AGAAGGAGAAGGAAAGGGAAAGGG and terminated at the 3 end 
sequence 5 "^^^^J^^^ „,,„,,eCTT the four dNTPs (0.2 

with the sequence Boehringer-Mannheim, Germany) , 

itW), 0.1% BSA (bovine serum albumin, Boehringe 

Tween 20. 8% DMSO (dimethylsulf oxide, Flu.a Chemicals. 

Switzerland) . IX Ampiitaq priming reaction 

DNA polymerase (Perkin Elmer, Foster City, CA) . The p . 
was a sfngle round of PCK under the following conditions 94 C 
minutes, eO'C for 30 seconds and 72'C for 45 seconds in a 
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■ r^u 1 q ImM EDTA was used in three succea;>xvc 

by incubating the sealed ^^^^^ gj-C for 2 

repetitions the following conditions. 94 C tor * 

Tutes, for 4S seconds. After completion of this program, the 

wells were kept at S'C until further use. 

A 640 base pair fragment corresponding to the central sequence of the 
A 640 base pai 5 . .agAAGGAGAAGGAAAGGGAAAGGG sequence) 

. Prime-it II labelling kit (Stratagene, San 

(Amersham, UK) using the Prime it 11 i 
Diego, CA) to generate a biotinylated probe. 

^<intfd in to a concentration of 2.5 nM in 
The biotinylated probe .'^^^^^^^J^" ,3 ,,3 hybridized to 

EasyHyb (Boehringer-Mannheim, Germany) and lb P 
each sample with the following temperature scheme (PTC 200 

1 . q4-C for 5 minutes, followed by 500 steps of 0.1 C 
thermocycler : 94 C fo 5 . ^^^^^^ ^^^^^ ^^^^^^ 

decrease in ^e-P^^ture every ^^^^^^ ^^^^^^^ 

"""r"s"edTsToU w 11 With 2X SSC/O.U SOS ,2X SSC; 0.3M 
are then washed J^''^ 0/0.001 mg/ml sodium dodecyl sulfate) 
NaCl/0.03M sodium citrate pH7^0/0 g ^^^^ ^.^^ 

at room temperature, once with 2X SSC/0.1% 

"■--='°;r.:ro/:::-— — ^^^^^ 
- " — ^^^^ rrrr::rti- 

counter (WALLAC, Turku, Finland). 
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Excess FluoSpheres*- are removed by washing for 30 min in 
Excess fciu p ^,„,,rature Images of the stained samples are 

TNT/0.1%BSA at room temperature. iin«*y . 
oLL-d using . 20X objective on .n inverted .ictoscope ,--xiO,e« 
V, C..i Lias A=, <»>e.KocKen, «i~n„ e,^ipped with , 
e ceo ce.et. .Princeton instr»,ents, Trenton, thron,K . 
xn3 filter set ,PBM6/FT580/I.P590, C-e,. Optical, Br.ttl.boro, VT, 
with a 5 second exposure. 

Pigure 4 Shows the hybridisation results for ^^^V generation on 

- Tr;;::r rt^ :hr ^^^^ — t 

tr^^~ - the nan.i. r.ions on the te.iate . 
not correspond to the primer sequences grafted onto the well. . 
contrast figure 4b shows the nunO^er of fluorescent spots detected 
contrast, figu flanking sequences on 

rrirtinr; r::::,::. celcutm, tne n^e. o. .morescent 
,p t detecte! „d te.ln, into consid.r.tion the .e,n„ic.tion. «e 
L esti^te t.t t.re ere — / ^ ^ ^ ' 

fnire^sTrrtorL; -!:. Uronn. end intensities nor^lised 

to the same values. 
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B) Scheme Showing The Simultaneoug Amplification And 

Immobilisation Of Nucleic Acid Molecules Os inq Two Different 
Types of Primer 

Referring now to Figure 5, another embodiment of the present 
invention is illustrated. Here two different immobilised primers are 
used to provide primer extension. 



In this embodiment the target molecule shown is provided with a 
nucleotide sequence at its 3' end (AAT-3' ) that is complementary to 
the sequence of a first primer, (5' -ATT, I), which is grafted on the 
surface, so that annealing with that primer can occur. The sequence 
(5'-GGT) at the 5' end of the target molecule, III, corresponds to 
the sequence (5'-GGT) of a second primer, II, which is also grafted 
to the surface, so that the sequence which is complementary to the 
sequence at the 5' end can anneal with that said second primer. 
Generally said complementary sequence (5'-ACC) is chosen so that it 
will not anneal with the first primer (5' -ATT). Unlike the situation 
described in section A, once the 3' end of a newly synthesised strand 
anneals to a primer on the surface, it will have to find a primer 
whose sequence is different from the sequence it carries at its 5' 
end (see the difference between Figures If and 5f ) . 



The embodiment shown in Figure 5 has an advantage over the embodiment 
illustrated in. Figure 1 since the possibility of one end of a single 
stranded target molecule annealing with another end of the same 
molecule in solution can be avoided and therefore amplification can 
proceed further. The possibility of annealing occurring between both 
ends of an immobilised complement to a target molecule can also be 

avoided. 
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EXAMPIJ! 2 



A mix Of two oligonucleotides which are phosphorylated at the 5' -end 
(Microsynth GmbH, Balgach, Switzerland) have been grafted on 96 well 

Nucleolink plates (Nunc, Denmark) as recommended by the manufacturer. 

The resulting plates has been stored dry at 4-C. The sequence of the 

primer, PI, was 5 ' -GCGCGTAATACGACTCACTA, the sequence of the other 

primer, P2, was 5' -CGCAATTAACCCTCACTAAA. These plates are specially 
formulated by Nunc, allowing the covalent grafting of 5'- 

phosphorylated DNA fragments through a standard procedure. 

A template has been cloned in a vector (pBlueScript Skminus, 
Stratagene Inc, San Diego, CA) with the appropriate DNA sequence at 
the cloning site (i.e., corresponding to PI and P2 at position 621 
and 794 respectively) , and 174 bp long linear double stranded DNA 
template has been obtained by PGR amplification, using PI and P2. The 
template PGR product has been purified on Qiagen Qia-quiclc columns 
(Qiagen GmbH, Hilden, Germany) in order to remove the nucleotides and 
the primers used during the PGR amplification. 

The purified template (in 50 pi solution containing IX PGR buffer 
(Perkin Elmer, Foster Gity, CA) with the four deoxyribonucleoside 
triphosphates (dNTPs) at 0.2 mM, (Pharmacia, Uppsala, Sweden) and 2.5 
units of AmpliTaq Gold DNA polymerase (Perkin Elmer, Foster City, 
CA)) has been spread on the support, i.e. on the Nucleolink plates 
grafted with PI and P2 (the plates have been rinsed with a solution 
containing 100 mM TRIS-HGl (pH 7.5), 150 mM NaCl and 0.1% Tween 20 
(Fluka, Switzerland) at room temperature for 15 min) . This solution 
has been incubated at 93'C for 9 minutes to activate the DMA 
polymerase and then 60 cycles (94-C/30 sec.,48-C/30 sec.,72'C/30 
sec.) have been performed on a PTC 200 thermocycler . Several 
different concentrations of PGR template have been tested 
(approximately 1, 0.5, 0.25, 0.125, 0.0625 ng/uD and for each sample 
a control reaction carried out without Taq polymerase has been 
performed (same conditions as above but without DNA polymerase) . 
Each sample has been stained with YO-PRO (Molecular Probes. Portland 
OR), a highly sensitive stain for double stranded DNA. The resulting 
products have been observed on a confocal microscope using a 40X 
objective (LSM 410, Carl Zeiss AG, Oberkochen, Germany) with 
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appropriate excitation (an 488 argon laser) and detection filters 
(510 low pass filter) (note: the bottom of each well is flat and 
allows observation with an inverted fluorescence microscope). 

In figure 6A, the control well (without added DNA template, panel a) 
shows only rare objects which can be observed on a bian)c surface 
(these objects were useful at this stage for reporting that the focus 
was correct) . These objects have an irregular shape, are 20 to 100 
micro-meters in size and have a thickness much larger than the field 
depth of the observation. In a well where DNA polymerase was present 
(figure 6A, panel ii) , in addition to the objects of irregular shape 
observed in the control well, a great number of fluorescent spots can 
be observed. They present a circular shape, they are 1 to 5 micro 
meters in size and do not span the field of view. The number of spots 
depends on the concentration of the template used for initiating 
colony formation. From the observed size of the colonies, one can 
estimate that more than 10,000 distinct colonies can be arrayed 
within 1 mm^ of support. 



EXAMPLE 3 

Oligonucleotides (Microsynth GmbH, Switzerland) were grafted onto 
NucleolinJc wells (Nunc, Denmark) . Oligonucleotide PI corresponds to 
the sequence 5' -TTTTTTCTCACTATAGGGCGAATTGG and oligonucleotide P2 
corresponds to 5' -TTTTTTCTCACTAAAGGGAACAAAAGCTGG. In each Nucleolink 
well, a 45 ^1 of 10 mM 1-methyl-imidazole (pH 7.0) (Sigma Chemicals, 
St. Louis, MO) solution containing 360 fmol of PI and 360 fraol of P2 
was added. To each well, 15 pi of 40 mM l-ethyl-3- (3- 
dimethylaminopropyD-carbodiimide (pH 7.0) (Sigma Chemicals) in 10 mM 
1-methyl-imidazole, was added to the solution of oligonucleotides. 
The wells were then sealed and incubated at 50°C for 16 hours. 
After the incubation, wells have been rinsed twice with 200 |il of RS 
(0,4N NaOH, 0.25% Tween 20), incubated 15 minutes with 200 pi RS, 
washed twice with 200 pi RS, and twice with 200pl TNT (lOOmM Tris/HCl 
pH7.5, 150mM NaCl, 0.1% Tween 20), before they are put to dry at 50'*C 
in an oven. The dried tubes were stored in a sealed plastic bag at 
4°C. 
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colony growing was initiated in each well with 15 pi of initiation 
mix (IX PGR buffer, 0.2 mM dNTPs and 0.75 units of AmpliTaq Gold DNA 
polymerase, 20 nanograms of template DNA, where the template DNA was 
either SI DNA or S2 DNA or a mixture of different ratios of SI DNA 
and S2 DNA, as indicated in discussion to figure 6B. 31 and S2 are 
704 base pair and 658bp fragments, respectively, which have been 
cloned into pBiueScript Skminus plasmids and subsequently amplified 
through a PGR using PI and P2 as primers. The fragments were purified 
on Qiagen Qia-quick columns (QIAGEN GmbH, Germany) in order to remove 
the nucleotides and the primers. 

Each well was sealed with Cycleseal™ (Robbins Scientific Corp., 
Sunnyvale, CA) , and incubated at 93 ^C for 9 minutes, 65'C for 5 
minutes and 72-G for 2 minutes and back to 93**C. Then 200 pi TNT 
solution was used in three successive one minute long washes at 93"C, 
The initiation mix was then replaced by 15 pi growing mix (same as 
initiation mix, but without template DNA), and growing was performed 
by incubating the sealed wells 9 minutes at 93 'C and repeating 40 
times the following conditions: 93**C for 45 seconds, GS'^C for 3 
minutes, 72 *C for 2 minutes. After completion of this program, the 
wells were kept at 6**C until further use. The temperature control was 
performed in a PTC 200 thermo-cycler, using the silicon pad provided 
in the Nucleolink kit and the heated (104^0 lid of the PTC 200. 

A 64Qbase pair fragment corresponding to the central sequence of the 
SI fragment, but not including the PI or P2 sequence was amplified by 
PGR as previously described. The probe was labelled with biotin-16- 
dOTP (Boehringer-Mannheim, Germany) using the Prime-it II random 
primer labelling kit (Stratagene, San Diego, CA) according to the 
manufacturers instructions. 

The biotinylated probes were hybridized to the samples in EasyHyb 
buffer (Boehringer-Mannheim, Germany), using the following 
temperature scheme (in the PTC 200 thermocycler) : 94'C for 5 minutes, 
followed by 68 steps of 0.5'C decrease in temperature every 30 
seconds (in other words, the temperature is decreased down to 60 °C 
in 34 minutes), using sealed wells. The samples are then washed 3 
times with 200pl of TNT at room temperature. The wells are then 
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incubated for 30 minutes with 50 ^1 TNT containing 0.1 mg/mi BSA. 
Then the wells are incubated 5 minutes with 15pl of solution of red 
fluorescent, Neutravidin-coated, 40 nm FluoSpheres**^* (580 nm 
excitation and 605 nm emission/ Molecular Probes, Portland, OR) . The 
solution of microspheres is made of 2 pi of the stock solution of 
microspheres, which have been sonicated for 5 minutes in a 50 W 
ultra-sound water-bath (Elgasonic, Bienne, Switzerland), diluted in 1 
ml of TNT solution containing 0.1 mg/ml BSA and filtered with Millex 
GV4 0.22 \m pore size filter (Millipore, Bedford, MA). 

The stained samples are observed using an inverted Axiovert 10 
microscope using a 20X objective (Carl Zeiss AG, Oberkochen, Germany) 
equipped with a Micromax 512x768 CCD camera (Princeton Instruments, 
Trenton, NJ) , using a XF43 filter set (PB546/FT580/LP590, Omega 
Optical, Brattieboro, VT) , and 10 seconds of light collection. The 
files are converted to TIFF format and processed in the suitable 
software (PhotoPaint, Corel Corp., Ottawa, Canada). The processing 
consisted in inversion and linear contrast enhancement, in order to 
provide a picture suitable for black and white print-out on a laser 
printer. 

The figure 6B shows the results for 3 different ratios of the S1/S2 
templates used in the initiating reaction: i) the S1/S2 is 1/0, many 
spots can be observed, ii) the S1/S2 is 1/10, and the number of spots 
is approximately 1/10 of the number of spots which can be observed in 
the i) image, as expected, and iii) the S1/S2 is 0/1, and only a few 
rare spots can be seen. 
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C Scheme Showing Simultaneous Amplification And iBunobilisation Of 

Nucleic Acid Molecules When The Target Molecule Contains 
Internal Sequences Complementary To The Immobilised Primers 

Figure 7 is provided to show that the sequences shown at the 5' and 
3' ends of the target molecule illustrated in Figures 1 and 5 need 
not be located at the ends of a target molecule. 

A target nucleic acid molecule (II) may have a sequence at each (or 
either) end that is neither involved in annealing with a primer nor 
in acting as a template to provide a complementary sequence that 
anneals with a primer (sequence 5' -AAA and sequence 5'-CCC). One of 
the internal sequences (5'-AAT) is used as a template to synthesise a 
complementary sequence, III, thereto (5'-TTT), as is clear from 
figures 7(a) to 7(e) . 

The sequence 5'-TTT is not however itself used to provide a sequence 
complementary thereto, as is clear from figures 7(f) to 7(k). It can 
be seen from Figure 7(1) that only one of the four immobilised 
strands shown after two rounds of primer extension and a strand 
separation step comprises the additional sequence 5'-TTT and that no 
strand comprising a complementary sequence (5' -AAA) to this sequence 
is present (i.e. only one strand significantly larger than the others 
is present) . After several rounds of amplification the strand 
comprising the sequence 5'-TTT will represent an insignificant 
proportion of the total niimber of extended, immobilised nucleic acid 
molecules present. 
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D) Using Wucleic Acid Strands Preaant in Colonies to Synthesise 
Additional Copies of Nucleic Acid Strands 

Amplified, single stranded nucleic acid molecules present in colonies 
provided by the present invention can themselves be used as templates 
to synthesise additional nucleic acid strands. 

Figure 8 illustrates one method of synthesising additional nucleic 
acids using immobilised nucleic acids as a starting point. 

Colonies will usually comprise both a given nucleic acid strand and 
its complement in immobilised form (figure 8a) . Thus they can be 
used to provide additional copies not only of a given nucleic acid 
strand but also of its complement. 

One way of doing this is to provide one or more primers (primers TTA 
and TGG) in solution that anneal to amplified, immobilised nucleic 
acid strands present in colonies (figure 8c) provided by the present 
invention. (These primers may be the same as primers initially used 
to provide the immobilised colonies, apart from being provided in 
free rather than immobilised form. ) The original DNA colony is 
denatured by heat to it single-stranded form (figure 8b) . allowing 
primers TTA and TGG to anneal to the available 3' end of each DNA 
strand. Primer extensionrx using AmpliTaq DNA polymerase and the four 
deoxyribonucleoside triphosphates (labelled or unlabeled) can then be 
used to synthesise complementary strands to immobilised nucleic acid 
strands or at least to parts thereof (step (iii) ) . 

Once newly formed strands (figure 8d) have been synthesised by the 
process described above, they can be separated from the immobilised 
strands to which they are hybridised (e.g. by heating). The process 
can then be repeated if desired using the PGR reaction, to provide 
large number of such strands in solution (figure 8e) . 



32 



SUBSTITUTE SHEET (RULE 26) 



wo 98/44151 



PCT/CB98/00961 



Strands synthesised in this manner, after separation from the 
immobilised strands, can, if desired, be annealed to one another 
(i.e. a given strand and its complement can anneal) to provide 
double-stranded nucleic acid molecules in solution. Alternatively 
they can be separated from one another to provide homogenous 
populations of single-stranded nucleic acid molecules in solution. 



It should also be noted that once single-stranded molecules are 
provided in solution they can be used as templates for PGR (or 
reverse PGR) . Therefore it is not essential to continue to use the 
immobilised nucleic acid strands to obtain further amplification of 
given strands or complementary strands thereto. 

It should be noted that where a plurality of colonies are provided 
and nucleic acid strands in different colonies have different 
sequences, it is possible to select only certain colonies for use as 
templates in the synthesis of additional nucleic acid molecules. 
This can be done by using primers for primer extension that are 
specific for molecules present in selected colonies. 

Alternatively primers can be provided to allow several or all of the 
colonies to be used as templates. Such primers may be a mixture of 
many different primers (e.g. a mixture of all of the primers 
originally used to provide all of the colonies, but with the primers 
being provided in solution rather than in immobilised form) . 



EXAMPLE 4 

Oligonucleotides (Microsynth GmbH Balgach, Switzerland) were grafted 
onto Nucleolink wells (Nunc, Denmark) . Oligonucleotide PI 
corresponds to the sequence 5' -TTTTTTTTTTCACCAACCCAAACCAACCGAAACC and 
oligonucleotide P2 corresponds to 5'- 

^^^^^TTTTTAGAAGGAGAAGGAARGGGAAAGGG. In each Nucleolink well, a 45 pi 
of 10 mM l-methyl-imidazole (pH 7.0) (Sigma Chemicals) solution 
containing 360 fmol of PI and 360 fmol of P2 was added. To each 
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well, 15 pl of 40 mM i-ethyl-3- (3-dimethylaminopropyl ) -carbodiimide 
(pH 7.0) (Sigma Chemicals) in 10 mM 1-methyl-imidazole, was added to 
the solution of oligonucleotides. The wells were then sealed and 
incubated at 50 "C for 16 hours. After the incubation, wells have 
been rinsed twice with 200 yxl of RS (0.4N NaOH, 0.25% Tween 20), 
incubated 15 minutes with 200 pl RS, washed twice with 200 pl RS, and 
twice with 200pl TNT (lOOmM Tris/HCl pH7.5, ISOmM NaCl, 0.1% Tween 
20), before they are put to dry at 50°C in an oven. The dried tubes 
were stored in a sealed plastic bag at 4''C. 

Colony growing was initiated in each well with 15 pl of initiation 
mix (IX PGR buffer, 0.2 mM dNTPs and 0.75 units of AmpliTaq DNA 
polymerase, 20 nanograms of template DNA, where the template DNA was 
either 31 DNA or S2 DNA or a 1/1 mixture of SI DNA and S2 DNA, as 
indicated in discussion to Example 3. SI and S2 are 658 base pair 
and 704 b.p. fragments, respectively, which have been prepared as 
described in EXAMPLE 3. 

Each well was sealed with Cycleseal™ (Robbins Scientific Corp., 
Sunnyvale, CA) and incubated at 93°C for 9 minutes, 65"C for 5 
minutes and 72«C for 2 minutes and back to 93-C. Then 200 pl TNT ^ 
solution was used in three successive one minute long washes at SS'C. 
The initiation mix was then replaced by 15 pl growing mix (same as 
initiation mix, but without template DNA) and growing was performed 
by incubating the sealed wells 9 minutes at 93"C and repeating 40 
times the following conditions: 93'C for 45 seconds, 65*C for 3 
minutes, 72'C for 2 minutes. After completion of this program, the 
wells were kept at e^C until further use. The temperature control 
was performed in a PTC 200 the rmo- cycler . 

Different treatments where applied to 6 sets (A,B,C,D,E and F)of 3 
wells (1,2,3), one prepared with template SI, one with template SI 
and template S2 and one prepared with template S2 alone (yielding 
A1,A2,A3,...,F1,F2,F3) . The set A was left untreated, set B has been 
incubated for 10 minutes with BAL-31 exonuclease (New England 
Biolabs, Beverly, MA) at 37'C in BAL-31 buffer (BAL-31 essentially 
digests double stranded DNA which has both ends free) , set C has been 
incubated for 10 minutes with SI nuclease (Pharmacia, Uppsala, 
Sweden) at 37''C in SI buffer (SI nuclease essentially digests single 
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stranded DNA) , set D, E and F have been incubated with both BAL-31 
and SI nucleases. Reactions were stopped by rinsing the wells with 
TNT buffer. 



PCR (25 cycles, 30 sec. at 94-C , 45 sec. at 60'C . 45 sec. at 72-C) 
has been performed in the Nucleolink wells with 0.25 pM primers P70 
(5'-CACCAACCCAAACCAACCCAAACCACGACTCACTATAGGGCGAA) and P71 (5'- 
AGAAGGAGAAGGAAAGGGAAAGGGTAAAGGGAACAAAAGCTGGA) in solution in sets A, 
B, C and D. P70 and P71 are suited for the amplification of both SI 
and S2, since primer P70 contains the sequence of primer PI and p71 
contains P2. In the set E wells, PCR has been performed with a set 
of forward (P150, 5 ' -GGTGCTGGTCCTCAGTCTGT ) and reverse {P151, 5'- 
CCCGCTTACCAGTTTCCATT) primers which are within SI and not within S2 
so as to produce a 321 bp PCR product, and in the set F wells, PCR 
has been performed with a set of forward {P152, 5'- 
CTGGCCTTATCCCTAACAGC) and reverse (P153, 5' -CGATCTTGGCTCATCACAAT) 
primers which are within S2 and not within SI so as to produce a 390 
bp PCR product. For each of the 18 PCR reactions, 3 pi of solution 
have been used for gel electrophoresis on 1% agarose in presence of 
0.1 (ig/ml ethidium-bromide. The pictures of the gels are presented 
in Figure 9. These pictures show that DNA in the colonies is 
protected from exonuclease digestion (sets B, C and D as compared to 
set A) , and that both SI and S2 can be recovered either 
simultaneously using PI and P2 (sets A, B, C and D) or specifically 
(set E and F) . In set E and F, where the shorter PCR products are 
more efficiently amplified than the longer PCR products in sets A, B, 
C, D, a cross-contamination between the SI and S2 templates is 
detectable (see lane E2 and Fl) . 
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P | Provi?^"" °f Seconrt ^ry Colonias 

It is also possible to modify initially formed colonies to provide 
f r en cLonies (i.e. to provide colonies comprising i^nobUxsed 
1 c acid molecules with different sequences from t.ose molecules 
nt in the initially formed colonies) . Here, the inxtxally 
present in the initia y -primary colonies" and the later 

will be suitable for secondary colony generation. 

Pi,ure 10 Shows how 'secondary primers' are generated using existing 

imy colonies. As a starting point, the primary colony (figure 
1 aris left in the fully hybridised, double-stranded form. A 

ll strand specific DNA exonuclease might be used to remove all 
r's i" Tave not been elongated. One could also choose to cap 
all free 3' -OH ends of primers with dideoxyribonucleotide 
triphirphates using a terminal transferase (step (i). fi.ure 
10b) . 



^loavpd bv using endonucieases . cv^j. t- 
be cl«v=d by g _j,i<, »Ul.in th= colon, (depicted by 

:re"'«."::rr:n z:\.^ cu..es ^ co.. ...p .u,, 

Ibd) can then be partially digested with a 3 to 5 do 
pe ific exonuclease (e.g. ..coli exonuclease III, depicted by H , 
steriii) figure lOd). In any case, the secondary primers are 
step (111), bv heat) and washing (figure 

available after denaturation (e.g., by heat) 



lOe) 



«„„.avel„ «e do„bU st„„ded fo^in, Ue colo.Us ,fl,c" 

..^^ uiith the double- St rand specific J ^ 

exonuclease, which digest oniy ,„ , fe„ bases of 

important case is when the exonuclease digests only a few 



An 
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the DNA molecule before being released in solution, and when 

estion can proceed when another en.y.e binds to the 0.. .o ecu e 
(figure log, . In this case the exonuclease digestxon w.ll proceed 
Intil there re.ain only single stranded molecules which, on average, 
are half the length of the starting material, and are without any 
complementary parts (which could form partial duplexes, rema.nxng .n 
the single stranded molecules in a colony (figure lOh) . 



I„ all cases, these treatments result in single-stranded fragments 
grafted onto a support which correspond to the sequence of the 
original template and that can be used for new DNA colony growxng .f 
an appropriate new template is provided for colony initiation 

(figures lOe and lOh) . 

The result of such a treatment, thus a support holding secondary 
prirners, will be referred to as a "support for secondary colony 
g 0^". Templates useful for secondary colony growing may inclu e 
Tie ules having .nown seguences ,or complements of — ^ 

Alternatively templates may be derived from unseguenced molecules 
'e.g. random fragments,. In either event the templates shoul be 

provided with one or more regions for annealing with nucleic acid 

strands present in the primary colonies. 

Figure 11 a-e show how a secondary colony can be generated when an 

o riate template (TP, figure na)is provided for a second round 
of DhI colony generation on a support for secondary colony growing, 
hlllg secoLary primers. In this example, treatment of the primary 
colony as described above has generated the secondary primers, 
and SP2 (figure 11a). The template TP, will hybridise to its 
complementary secondary primer, SPl, and following an «t-sxon 
reaction using a DNA polymerase as described, will be extended as 
Tpicted (figure 12b,. following a denaturing (step ii, , re.nneaUng 
(Step iii) and DNA polymerase (step iv, cycle, a replica of the 
original primary colony will be formed (figure He). 
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The n.axi«u. size of a secondary colony provided by thxs embodxmen o 
the present invention is restricted by the size of the primary colony 
onto which it grows. Several secondary growing processes can be used 
sequentially to create colonies for specific applications (.^e a 
first colony can be replaced with a second colony, the second colony 
can be replaced with a third colony, etc.) 
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F) 



D^««i«ion of Extendfld Primers 



Figure 12 shows how extended primers can be generated on an array of 
oligonucleotides. The same procedure could be applied to a support 
covered with colonies or secondary primers as described in section E. 

in Figure 12a) a support is provided having a plurality of 
immobilised primers shown thereon. Different immobilised primers are 
shown present in different regions of the support (represented by 
squares). Primers having the sequence 5' -AAA are present in one 
square and primers having the sequence 5' -GGG are present in another 
square. 

Figures 12b) to 12d) show how the initial primers present (initial 
primers) are modified to give different primers (extended primers), 
in this example, those initial primers having the sequence 5' -AAA are 
modified to produce two different types of extended primers, havxng 
the sequences 5' -AAAGCC and 5'-AAATAC respectively. This xs achieved 
.y the hybridisation of oligonucleotide templates. S'-GTATTT and 5 - 
GGCTTT to the primary primers immobilised on the surface (figure ^ 
12b), followed by DNA polymerase reaction. Those initial primers 
having the sequence 5'-GGG are modified to produce two different 
types of extended primers, having the sequences 5'-GGGTAT and 5 - 
GGGTAA (figure 12d) in a similar manner. 

The technique of producing extended primers is useful for 
transforming immobilised oligonucleotides provided on a DNA chip or 
other surface into immobilised primers useful in amplifying a 
particular target nucleic acid sequence and/or in amplifying a 
complementary strand thereto. 
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r.i Preparation of nMC ^-^" fragments 



Apparatuses of the present invention can be used for varxous 
pfocedures so.e of which will be described later on. Nucle.c ac d 
fragments for use in colony generation may be prepared differently 
ToHhe different procedures (referred to herein as "prepared nucle.c 
acids") . various preparation procedures are described below: 



(i) 



Preparation of random DVK fragments 



Here is described a method to prepare DNA originating from one 
.iological sample (or from a plurality of samples, for an.plif---n 
in the case where it is not necessary to .eep tracK of the orxg.n of 
the DNA when it is incorporated within a colony. 

The DKA of interest is first extracted from the biological sample and 
cut randomly into "small" pieces (e.g., 50 to 10,000 bases long, but 
preferentially SCO to 1000 base pairs in length, represented by^ar 

figure 13a) . (This, can be done e.g., by a phenol-chloroform 
extraction followed by ultrasound treatment, mechanical shearing, by 
plial digestion with frequent cutter restriction endonucleases or 
other methods Known by those skilled in the art, . - 
standardise experimental conditions, the extracted and cut DKA 
fragments can be size-fractionated, e.g., by agarose gel 
electrophoresis, sucrose gradient centrifugation or gel 
c rltography. Fragments obtained within a single ^^^^ 
used in providing templates in order to reduce the varxabxUty xn 
Size of the templates. 

secondly, the extracted, cut and (optionally, ported 

£„^e„t. can be U,ated -Itn ollgonacleotid. Unlets ,11. and lib, 

13a, contalnin, tbe ..<^ence o£ the pti^rla, «hxch have 
p riously been ,r..ted onto a auppott. ,hla can be - 
Ltanca, usln, -blunt-end- ligation. Alternatively, the te.plate 
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1. fl.nkea by the =eq«n» ol th. prln,«. that a.a grafted on the 
p o t ThL Clohed .™ =.» he ..pUUe. -ithih a hiol=,lcal ho,t 

raoted. ohviou.l,, i. o„. «o..in, with . si„,l. Pt.net 
"!<te. to the .olid .„ppott toi D»A oolohy .o„.tio„, P^'^Vin^ 
i"U«s cohtaihih, both Pi a„d P2 pti.et, doe. hot pose a p,:obl«.. 
„etea«.i, the .t.^»ts obtaihed aft.t .ooh ' 
are designated by th. expr.saioh . "prepared ge.o.io D»A (IH, 
figure 13a) . 

(ii) Preparation of random D«A fragments originating from a 
plurality of saaplas 

Here it is described how to prepare WA origihatin, fro» a Pl»"l"' 
oHiriogicai s«^les Ih the case where it is n.cess.ry to Keep tracK 
o, the Irigin of the wheh it i. ihcorporated -ithih a coiohy. 

- -rt"::hrs^r ::eTi:,=^^^^^ 
:::T.hriyrtTeh::c'.» .r..e„ts are „o. 

segoehca o, the pri.ers grafted ' -h-^- J 

pnd a ^^taq" sequence which is dirrerenc 
h h. Lad for identifying the origin of the - 

„ote that for each sa.^1., the tag «y not be .hrgue. b.t a ploraii.y 
of tags could b. used. Hereafter, we will designate the D»A 
tg::^: Obtained after such . suitable process hy the expressron . 
"tagged genomic DNA" (HI, Figure 13b) . 

This tagging procedure can be used for providing colonies carrying a 
eans of identification which is independent from the sequence 
cedb be template itself. This could also be -^ful when so«e 
colonies are to be recovered specifically (using the procedure g.ve. 
: sec ion 0^ This could also be useful in the case the recovered 
colonies are further processed, e.g., by creating new primary 
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colonies and a cross reference between the original colonies and the 
new colonies is desired. 



(iii) Preparation of DHA fragments corresponding to a plurality of 
DNA sequences originating from one sample 



The DNA of interest can first be extracted from a biological sample 

by any means known by those skilled in the art (as mentioned supra) . 

Then the specific sequences of interest can be amplified with PGR 
(Step (i), figure 13c) using PGR primers (lla and lib) made of two 

parts; 1) at the 5' -end, the sequences corresponding to the sequences 
of primer oligonucleotide (s) that have been grafted onto a surface 
(PI and P2)and 2) at the 3' -end, primer sequences specific to the 
sequence of interest (SI and S2) . Hereafter, we will designate the 
DNA fragments obtained after such a suitable process by the 
expression: "prepared DNA" (III, figure 13c) . 

(iv) Preparation of a plurality of DHA fragments originating from a 
plurality of samples 

The procedure is the same as in the previous section except that in 
this case the DNA primers (Ila and lib) used to perform the PGR 
amplification (step (i), figure 13d) are now made of three parts; 1) 
the sequence of the primers grafted onto the surface (PI and P2), 2) 
a "tag" sequence which is different for each sample and which will be 
used for the identifying the origin of the DNA colony and 3) primer 
sequences surrounding the specific sequence of interest (SI and S2) . 
Note that for each sample, a plurality of tags might be used, as in 
(ii) supra. 

Hereafter, we will designate the DNA fragments obtained after such a 
suitable process by the expression : "tagged DNA" (HI. figure 13d) . 
Potential uses of tags are the same as in (ii) , supra. 
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(v) Preparation 



of mBNA 



The procedure is similar to the procedures described for preparing 
ONA fragments in the previous sections except that the starting poxnt 
is to extract mRNA by any .eans Known to those sKilled -^^^ 
,e q by use of conunercially available mRNA preparation Kits) . The 
.R^'can be copied into double-stranded cDNA by any means Known to 
: ose sKilled in the art (e.g. by using a reverse transcriptase and 
0.A polymerase, . Certainly, the tags and pri.ers described supra can 
be used in conjunction with the process of double-stranded cDNA 
synthesis to allow their incorporation into the templates. 
Leafter, we will designate the mKKA fragments obtained after such 

suitable processes by the expressions: "prepared total mKNA (cf . 

.^prepared genomic DMA", as described in section 

total mRNA". (cf . "tagged genomic DHA", as described in section i 
supra,, "prepared mRHA" (cf. "prepared DNA", as described in section 
aU, supra, and "tagged mKNA" (cf. "tagged OKA", as described m 

section (iv) supra) . 
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H) 



Preferred Detection Assays 



a) 



bl 



in assay procedures of the present invention labels may be used to 
provide detectable signals. Examples include: 

a fluorescent group or a energy-transfer based fluorescence 
system. 

a biotin based system. In this case colonies can be incubated 
with streptavidin labelled with a fluorescent group or an 
enzyme (e.g. fluorescent latex beads coated with streptavidin; 
streptavidin labelled with fluorescent groups; enzymes for use 
with the corresponding fluorescence assay) . 
-) a system based on detecting an antigen or a fragment thereof - 
e g a hapten (including biotin and fluorescent groups) . In 
this case colonies can be incubated with antibodies (e.g. 
specific for a hapten) . The antibodies can be labelled with a 
fluorescent group or with an enzyme (e.g. fluorescent latex 
beads coated with the antibody; antibodies labelled with 
fluorescent groups; antibodies linked to an enzyme for use wxth 
a corresponding fluorescence or luminescence assay, etc.). 
d, a radio-label (e.g. incorporated by using a 5 -polynucleotide 

kinase and lr--P] adenosine triphosphate or a DNA polymerase and 
(a-"P or a-»Pl deoxyribonucleoside triphosphates to add a 
radioactive phosphate group(s) to a nucleic acid) . Here 
colonies can be incubated with a scintillation liquid. 

e) a dye or other staining agent. 

Labels for use in the present invention are preferably attached 

a) to nucleic acids 

b) to proteins which bind specifically to double stranded DNA (e.g., 
histones, repressors, enhancers) 

and/or 

c) to proteins which bind specifically to single stranded DNA (e.g. 
single-stranded nucleic acid binding protein) . 
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Labelled colonies are preferably detected by: 

a) measuring fluorescence. 

b) measuring luminescence. 

c) measuring radioactivity 

d) measuring flow or electric field induced fluorescence anisotropy. 
and/or 

e) measuring the polymer layer thickness. 

Staining agents can be used in the present invention. Thus DNA 
colonies can be incubated with a suitable DNA-specific staining 
agent, such as the intercalating dyes, ethidium bromide. YO-YO. YO- 
PRO (Molecular Probes. Eugene, OR) . 

With certain staining agents the result can be observed with a 
suitable fluorescence imaging apparatus. 

Examples of particular assays/procedures will now be described in 

greater detail: 
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12 Preferred E">>odiments of Assays of the Present Invention 

(i) Nucleic acid probe hybridisation assay 

DNA colonies are first prepared for hybridisation. Then they are 
hybridised with a probe (labelled or unlabelled) . If required, the 
hybridised probed is assayed, and the result is observed. This can 
be done with an apparatus of the present invention (e.g. as described 
supra) . 

Preparation for hybridisation 

In a preferred embodiment of the present invention colonies are 
treated with a DNA restriction endonuclease which is specific either 
for a sequence provided by a double stranded form of one of the 
primers originally grafted onto the surface where colonies are formed 
or for another sequence present in a tenqplate DNA molecule (see e.g. 
figure 16c) . 



After restriction enzyme digestion, the colonies can be heated to a 
temperature high enough for double stranded DNA molecules to be 
separated. After this thermal denaturing step, the colonies can be 
washed to remove the non-hybridised, detached single-stranded DNA 
strands, leaving a remaining attached single-strand DNA. 

In'another embodiment the colonies can be partially digested with a 
double-strandspecific 3' to 5' DNA exonuclease (see section E, figure 
lOf) which removes one strand of DNA duplexes starting from the 3' 
end, thus leaving a part of a DNA molecule in single stranded form. 



Alternatively, DNA in colonies can first be heat denatured and then 
partially digested with an single-strand specific 3' to 5' DNA 
exonuclease which digests single stranded DNA starting from the 3' 
end. 
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A further alternative is simply to heat denature DNA in the colonies. 



Hybridisation of the probe 

Single-stranded nucleic acid probes (labelled or unlabelled) can be 
hybridised to single-stranded DNA in colonies at the appropriate 
temperature and buffer conditions (which depends on the sequence of 
each probe, and can be determined using protocols known to those 
skilled in the art) . 



Assaying of unlabelle d hybridised probes 

A hybridised probe provided initially in unlabelled form can be used 
as a primer for the incorporation of the different (or a subset of 
the different) labelled (or a mix of labelled and unlabelled) 
deoxyribonucleoside triphosphates with a DNA polymerase. The 
incorporated labelled nucleotides can then be detected as described 



supra . 



cyclic assaying of lab elled or unlabelled probes 

Firstly, the DNA colonies can be prepared for hybridisation by the 
methods described supra. Then they can be hybridised with a probe 

(labelled or initially unlabelled) . If required, hybridised labelled 
probes are assayed and the result is observed with an apparatus as 
described previously. The probe may then be removed by heat 

denaturing and a probe specific for a second DNA sequence may be 

hybridised and detected. These steps maybe repeated with new probes 

as many times as desired. 

Secondly, the probes can be assayed as described supra for unlabelled 
probes, except that only a subset (preferably 1 only) of the 
different (labelled or unlabelled) nucleotides are used at each 
cycle. The colonies can then be assayed for monitoring the 
incorporation of the nucleotides. This second process can be 
repeated until a sequence of a desired length has been determined. 



47 



SUBSTITUTE SHEET (RULE 26) 



wo 98/44151 



PCT/GB98/00961 



(ii) in situ RNA Synthesis assay 



In this embodiment, DNA colonies can be used as templates for in situ 
RNA synthesis as depicted in figure 14. DNA colonies can be 
generated from templates and primers, such that a RNA polymerase 
promoter sequence is positioned at one end of the double-stranded DNA 
in the colony. DNA colonies can then be incubated with RNA polymerase 
and the newly synthesised RNA (cRNA) can be assayed as desired. The 
detection can be done non-specifically (e.g., staining) or in a 
sequence dependent way (e.g., hybridisation). 

The DNA template (I, figure 14a) to be amplified into a colony is 
generated by PGR reaction using primers (Ila and lib) which have the 
following four parts; 1) sequence identical to the sequences of the 
primers grafted onto the surface ('PI' and 'P2'). 2) a «tag" sequence 
which is different for each sample , a sequence corresponding to a 
RNA polymerase promoter, ie. the T3, T7 and SP6 RNA promoters, 
CRPP', figure 14a) and 4) primer sequences surrounding the specific 
sequence of interest ('SI' and 'S2' ) . Hereafter, we will designate 
the DNA fragments obtained after such a suitable process by the 
expression: "tagged RNA synthesis DNA" (III, figure 14b). 

After amplification of the DNA template from the original DNA sample, 
these templates are used to generate DNA colonies. The DNA colonies 
(IV, figure 14c) are then incubated with the RNA polymerase specific 
for the RNA polymerase promoter ( 'RPP' , figure 14c) . This will 
generate a copy of RNA specific for the DNA colony template 
(Template-cRNA, V, figure 14d) . 



cRNA thus synthesised can be isolated and used as hybridisation 
probes, as messenger RNA (mRNA) templates for in vitro protein 
synthesis or as templates for in situ RNA sequence analysis. 
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In another embodiment of the present invention, colonies can be 
analysed in order to determine sequences of nucleic acid molecules 
which form the colonies. Since very large numbers of the same nucleic 
acid molecules can be provided within each colony the reliability of 
the sequencing data obtained is likely to be very high. 

The sequences determined may be full or partial. Sequences can be 
determined for nucleic acids present in one or more colonies. A 
plurality of sequences may be determined at the same time. 

In some embodiments the sequence of a complementary strand to a 
nucleic acid strand to be sequenced (or of a part thereof) may be 
obtained initially. However this sequence can be converted using 
base-pairing rules to provide the desired sequence (or a part 
thereof) . This conversion can be done via a computer or via a 
person. It can be done after each step of primer extension or can be 
done at a later stage. 

Sequencing can be done by various methods . For example methods 
relying on sequential restriction endonuclease digestion and linker 
ligation can be used. One such method is disclosed in WO95/27080 for 
example. This method comprises the steps of: ligating a probe to an 
end of a polynucleotide, the probe having a nuclease recognition 
site; identifying one or more nucleotides at the end of the 
polynucleotide; and cleaving the polynucleotide with a nuclease 
recognising the nuclease recognition site of the probe such that the 
polynucleotide is shortened by one or more nucleotides. 

However in a preferred method of the present invention, amplified 
nucleic acid molecules (preferably in the form of colonies, as 
disclosed herein) are sequenced by allowing primers to hybridise with 
the nucleic acid molecules, extending the primers and detecting the 
nucleotides used in primer extension. Preferably, after extending a 
primer by a single nucleotide, the nucleotide is detected before a 
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further nucleotide is used in primer extension (step-by-step 
sequencing) . 

One or more of the nucleotides used in primer extension may be 
labelled. The use of labelled nucleotides during primer extension 
facilitates detection, (The term "label" is used in its broad sense 
to indicate any moiety that can be identified using an appropriate 
detection system. Preferably the label is not present in naturally- 
occurring nucleotides.) Ideally, labels are non-radioactive, such as 
f luorophores . However radioactive labels can be used. 

Where nucleotides are provided in labelled form the labels may be the 
same for different nucleotides. If the same label is used each 
nucleotide incorporation can be used to provide a cumulative increase 
of the same signal (e.g. of a signal detected at a particular 
wavelength) . Alternatively different labels may be used for each type 
of nucleotide (which may be detected at different wavelengths) . 

Thus four different labels may be provided for dATP, dTTP, dCTP and 
dGTP, or the same label may be provided for them all. Similarly, four 
different labels may be provided for ATP, UTP,CTP and GTP, or the 
same label may be provided for them all ) . In some embodiments of the 
present invention a mixture of labelled and unlabelled nucleotides 
may be provided, as will be described in greater detail later on. 

In a preferred embodiment of the present invention the sequencing of 
nucleic acid molecules present in at least 2 different colonies is 
performed simultaneously. More preferably, sequencing of nucleic acid 
molecules present in over 10, over 100, over 1000 or even over 
1,000,000 different colonies is performed simultaneously. Thus if 
colonies having different nucleic acids molecules are provided, many 
different sequences (full or partial) can be determined 
simultaneously - i.e. over 10, over 100, over 1000 or even over 
1,000,000 different sequences may be determined simultaneously. 

If desired, controls may be provided, whereby a plurality of 
colonies comprising the same nucleic acid molecules are provided. By 
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determining whether or not the same sequences are obtained for 
nucleic acid molecules in these colonies it can be ascertained 
whether or not the sequencing procedure is reliable. 

One sequencing method of the present invention is illustrated in 
figure 17, which is entitled "in situ sequencing". On prepared DNA 
colonies hybridised with an appropriate sequencing primer, cyclic 
addition of the individual deoxyribonucleoside triphosphates and DNA 
polymerase will allow the determination of the DNA sequence 
immediately 3' to the sequencing primer. In the exan^le outlined in 
figure 17, the addition of dGTP allows the determination of colony 1 
to contain a *G' . In the second cycle addition of dATP is detected in 
both colonies, determining that both colonies have an 'A' in the next 
position. After several repetitions of the addition of single 
deoxyribonucleoside triphosphates, it will be possible to determine 
any sequence. For example sequences of at least 10, at least 20, at 
least 50 or at least 100 bases may be determined. 



If colonies are provided initially in a form comprising double- 
stranded molecules the colonies can be processed to provide single- 
stranded molecules for use in sequencing as described above. (It 
should however be noted that double stranded molecules can be used 
for sequencing without such processing. For example a double stranded 
DNA molecule can be provided with a promoter sequence and step-by- 
step sequencing can then be performed using an RNA polymerase and 
labelled ribonucleotides (of Figure 16 d) ) . Another alternative is 
for a nick to be introduced in a double stranded DNA molecule so that 
nick translation can be performed using labelled 

deoxyribonucleotides. and a DNA polymerase with 5' to 3' exonuclease 
activity. ) 

One way of processing double-stranded molecules present in colonies 
to provide single-stranded colonies as described later with reference 
to Figure 19. Here double-stranded immobilised molecules present in a 
colony (which may be in the form of bridge-like structures) are 
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cleaved and this is followed by a denaturing step. (Alternatively a 
denaturing step could be used initially and could be followed by a 
cleavage step) . Preferably cleavage is carried out enzymatically . 
However other means of cleavage are possible, such as chemical 
cleavage. (An appropriate cleavage site can be provided in said 
molecule). Denaturing can be performed by any suitable means. For 
example it may be performed by heating and / or by changing the ionic 
strength of a medium in the vicinity of the nucleic acid molecules. 

Once single-stranded molecules to be sequenced are provided, suitable 
primers for primer extension can be hybridised thereto. 
Oligonucleotides are preferred as primers. These are nucleic acid 
molecules that are typically 6 to 60, e.g. 15 to 25 nucleotides long. 
They may comprise naturally and / or non-naturally occurring 
nucleotides. (However other molecules, e.g. longer nucleic acid 
strands may alternatively be used as primers, if desired.) The 
primers for use in sequencing preferably hybridise to the same 
sequences present in amplified nucleic acid molecules as do primers 
that were used to provide said amplified nucleic acids- (Primers 
having the same / similar sequences can be used for both 
amplification and sequencing purposes) . 

When primers are provided in solution and are annealed (hybridised) 
to nucleic acid molecules present in colonies to be sequenced, those 
primers which remain in solution or which do not anneal specifically 
can be removed after annealing. Preferred annealing conditions 
(temperature and buffer composition) prevent non-specific 
hybridisation. These may be stringent conditions. Such conditions 
would typically be annealing temperatures close to a primer's Tm 
(melting temperature) at a given salt concentration (e.g. 50 nM 
primer in 200 mM NaCl buffer at 55 for a 20-mer oligonucleotide 
with 50% GC content) . (Stringent conditions for a given system can be 
determined by a skilled person. They will depend on the base 
composition, GC content, the length of the primer used and the salt 
concentration. For a 20 base oligonucleotide of 50% GC, calculated 
average annealing temperature is 55-60'C, but in practice may vary 
between 35 to TCC) , 
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Primers used for primer extension need not be provided in solution, 
since they can be provided in immobilised form. In this embodiment 
the primers should be provided in the vicinity of the immobilised 
molecules to which they are to be annealed. (Such primers may indeed 
already be present as excess immobilised primers that were not used 
in amplifying nucleic acid molecules during the formation of 
colonies . ) 

The nucleic acid molecules present in colonies to be sequenced will 
include a sequence that hybridises to the primers to be used in 
sequencing (preferably under "stringent" conditions) . This portion 
can be added to a given molecule prior to amplification (which 
molecule may have a totally / partially unknown sequence) using 
techniques known to those skilled in the art. For example it can be 
synthesised artificially and can be added to a given molecule using a 
ligase . 

Once a nucleic acid molecule annealed to a primer is provided, primer 
extension can be performed. RNA or DNA polymerases can be used. DNA 
polymerases are however the enzymes of choice for preferred 
embodiments. Several of these are commercially available. 
Polymerases which lack 3' to 5* exonuclease activity can be used, 
such as T7 DNA polymerase or the small (Kienow) fragment of DNA 
polymerase I may be used [e.g. the modified T7 DNA polymerase 
Sequenase* 2.0 (Amersham) or Kienow fragment (3' to 5' exo-, New 
England Biolabs)]. However it is not essential to use such 
polymerases. Indeed, where it is desired that the polymerases have 
proof-reading activity polymerases lacking 3' to 5* exonuclease 
activity would not be used. Certain applications may require the use 
of thermostable polymerases such as ThermoSequenase™ (Amersham) or 
Taquenase-« (ScienTech, St Louis, MO) . Any nucleotides may be used for 
primer extension reactions (whether naturally occurring or non- 
naturally occurring). Preferred nucleotides are 

deoxyribonucleotides^ dATP, dTTP, dGTP and dCTP (although for some 
applications the dTTP analogue dUTP is preferred) or ribonucleotides 
ATP, UTP, GTP and CTP; at least some of which are provided in 
labelled form. 
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A washing step is preferably incorporated after each primer extension 
step in order to remove unincorporated nucleotides that may interfere 
with subsequent steps. The preferred washing solution should be 
compatible with polymerase activity and have a salt concentration 
that does not interfere with the annealing of primer molecules to the 
nucleic acid molecules to be sequenced. (In less preferred 
embodiments, the washing solution may interfere with polymerase 
activity. Here the washing solution would need to be removed before 
further primer extension.) 

Considering that many copies of molecules to be sequenced can be 
provided in a given colony, a combination of labelled and non- 
labelled nucleotides can be used. In this case, even if a small 
proportion of the nucleotides are labelled (e.g. fluorescence- 
labelled) , the number of labels incorporated in each colony during 
primer extension can be sufficient to be detected by a detection 
device. For example the ratio of labelled to non-labelled nucleotides 
may be chosen so that, on average, labelled nucleotides are used in 
primer extension less than 50%, less than 20%, less than 10% or even 
less than 1% of the time (i.e. on average in a given primer extension 
step a nucleotide is incorporated in labelled form in less than 50%, 
less than 20%, less than 10 %, or less than 1% of the extended 
primers. ) 

Thus in a further embodiment of the present invention there is 
provided a method for sequencing nucleic acid molecules present in a 
colony of the present invention, the method comprising the steps of: 

a) providing at least one colony comprising a plurality of single 
stranded nucleic acid molecules that have the same sequences as 
one another and that are hybridised to primers in a manner to 
allow primer extension in the presence of nucleotides and a 
nucleic acid polymerase; 

b) providing said at least one colony with a nucleic acid 
polymerase and a given nucleotide in labelled and unlabelled 
form under conditions that allow extension of the primers if a 
complementary base or if a plurality of such bases is present 
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at the appropriate position in the single stranded nucleic acid 
molecules present in said at least one colony; 

c) detecting whether or not said labelled nucleotide has been used 
for primer extension by determining whether or not the label 
present on said nucleotide has been incorporated into extended 
primers; 

Steps b) and c) may be repeated one or more times. Preferably a 
plurality of different colonies are provided and several different 
sequences are determined simultaneously. 

This further embodiment of the present invention can be used to 
reduce costs, since relatively few labelled nucleotides are needed. 
It can also be used to reduce quenching effects. 

It is however also possible to use only labelled nucleotides for 
primer extension or to use a major portion thereof (eg. over 50 %, 
over 70% or over 90% of the nucleotides used may be labelled) , This 
can be done for example if labels are selected so as to prevent or 
reduce quenching effects. Alternatively labels may be removed or 
neutralised at various stages should quenching effects become 
problematic (e.g. laser bleaching of fluorophores may be performed) . 
However this can increase the number of steps required and it is 
therefore preferred that labels are not removed (or at least that 
they are not removed after each nucleotide has been incorporated but 
are only removed periodically) , In other less preferred embodiments/ 
the primer itself and its extension product may be removed and 
replaced with another primer. If required, several steps of 
sequential label-free nucleotide additions may be performed before 
actual sequencing in the presence of labelled nucleotides is resumed. 
A further alternative is to use a different type of label from that 
used initially (e.g. by switching from fluorescein to rhodamine) 
should quenching effects become problematic. 

In preferred embodiments of the present invention a plurality of 
labelled bases are incorporated into an extended primer during 
sequencing. This is advantageous in that it can speed up the 
sequencing procedure relative to methods in which, once a labelled 
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base has been incorporated into an extended primer, the label must be 
removed before a further labelled base can be incorporated. (The 
plurality of labelled bases may be in the form of one or more 
contiguous stretches, although this is not essential.) 

The present invention therefore also includes within its scope a 
method for sequencing nucleic acid molecules, comprising the steps 
of: 

a) using a first colony to provide a plurality of single stranded 
nucleic acid molecules that have the same sequences as one 
another and that are hybridised to primers in a manner to allow 
primer extension in the presence of nucleotides and a nucleic 
acid polymerase; 

b) using a second colony to provide a plurality of single stranded 
nucleic acid molecules that have the same sequences as one 
another, and that are also hybridised to primers in a manner to 
allow primer extension in the presence of nucleotides and a 
nucleic acid polymerase; 

c) providing each colony with a nucleic acid polymerase and 
a given labelled nucleotide under conditions that allow 
extension of the primers if a complementary base or if a 
plurality of such bases is present at the appropriate position 
in the single stranded nucleic acid molecules; 

d) detecting whether or not said labelled nucleotide has been used 
for primer extension at each colony by determining whether or 
not the label present on said nucleotide has been incorporated 
into extended primers; 

e) repeating steps c) and d) one or more times so that extended 
primers comprising a plurality of labels are provided. 

Preferably the sequences of the nucleic acid molecules present at 
said first and said locations are different from one another - i.e. a 
plurality of colonies comprising different nucleic acid molecules are 
sequenced 
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In view of the foregoing description it will be appreciated that a 
large number of different sequencing methods using colonies of the 
present invention can be used. Various detection systems can be used 
to detect labels used in sequencing in these methods (although in 
certain embodiments detection may be possible simply by eye, so that 
no detection system is needed) . A preferred detection system for 
fluorescent labels is a Charge-Coupled-Device (CCD) camera, which can 
optionally be coupled to a magnifying device. Any other device 
allowing detection and, preferably, also quantification of 
fluorescence on a surface may be used. Devices such as fluorescent 
imagers or confocal microscopes may be chosen. 

In less preferred embodiments, the labels may be radioactive and a 
radioactivity detection device would then be required. Ideally such 
devices would be real-time radioactivity imaging systems. Also less 
preferred are other devices relying on phosphor screens (Moleculal 
Dynamics) or autoradiography films for detection. 

Depending on the number of colonies to be monitored, a scanning 
system may be preferred for data collection. (Although an 
alternative is to provide a plurality of detectors to enable all 
colonies to be covered.) Such a system allows a detector to move 
relative to a plurality of colonies to be analysed. This is useful 
when all the colonies providing signals are not within the field of 
view of a detector. The detector may be maintained in a fixed 
position and colonies to be analysed may be moved into the field of 
view of the detector (e.g. by means of a movable platform). 
Alternatively the colonies may be maintained in fixed position and 
the detection device may be moved to bring them into its field of 



view. 



The detection system is preferably used in combination with an 
analysis system in order to determine the number (and preferably also 
the nature) of bases incorporated by primer extension at each colony 
after each step. This analysis may be performed immediately after 
each step or later on, using recorded data. The sequence of nucleic 
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acid molecules present within a given colony can then be deduced from 
the number and type of nucleotides added after each step. 

Preferably the detection system is part of an apparatus comprising 
other components. The present invention includes an apparatus 
comprising a plurality of labelled nucleotides, a nucleic acid 
polymerase and detection means for detecting labelled nucleotides 
when incorporated into a nucleic acid molecule by primer extension, 
the detection means being adapted to distinguish between signals 
provided by labelled nucleotides incorporated at different colonies. 

The apparatus may also include temperature control, solvent delivery 
and washing means. It may be automated. 

Methods and apparatuses within the scope of the present invention can 
be used in the sequencing of: 

• unidentified nucleic acid molecules (i.e. de novo sequencing); 

• and nucleic acid molecules which are to be sequenced to check if 
one or more differences relative to a known sequence are present 
(e.g. identification of polymorphisms) . This is sometimes 
referred to as "re-sequencing". 

Both de novo sequencing and re-sequencing are discussed in greater 
detail later on (see the following sections (v) and (vi) ) . 

For de novo sequencing applications, the order of nucleotides applied 
to a given location can be chosen as desired. For example one may 
choose the sequential addition of nucleotides dATP, dTTP, dGTP, dCTP; 
dATP, dTTP, dGTP, dCTP; and so on. (Generally a single order of four 
nucleotides would be repeated, although this is not essential.) For 
re-sequencing applications, the order of nucleotides to be added at 
each step is preferably chosen according to a known sequence. 

Re-sequencing may be of particular interest for the analysis of a 
large number of similar template molecules in order to detect and 
identify sequence differences (e.g. for the analysis of recombinant 
plasmids in candidate clones after site directed mutagenesis or more 
importantly, for polymorphism screening in a population) . Differences 
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from a given sequence can be detected by the lack of incorporation of 
one or more nucleotides present in the given sequence at particular 
stages of primer extension. In contrast to most commonly used 
techniques, the present method allows for detection of any type of 
mutation such as point mutations, insertions or deletions. 
Furthermore, not only known existing mutations, but also previously 
unidentified mutations can be characterised by the provision of 
sequence information. 

In some embodiments of the present invention long nucleic acid 
molecules may have to be sequenced by several sequencing reactions, 
each one allowing for determination of part of the complete sequence. 
These reactions may be carried out at different colonies (where the 
different colonies are each provided with the same nucleic acid 
molecules to be sequenced but different primers), or in successive 
cycles applied at the same colony (where between each cycles the 
primers and extension products are washed off and replaced by 
different primers) . 



(iv) DHA Fingerprinting 



This embodiment of the present invention aims to solve the problem of 
screening a large population for the identification of given features 
of given genes, such as the detection of single nucleotide 
polymorphisms . 

In one preferred embodiment, it consists in generating tagged genomic 
DNA (see section G(ii) supra). (Thus each sample originating from a 
given individual sample has been labelled with a unique tag) . This 
tagged DNA can be used for generating primary colonies on an 
appropriate surface comprising immobilised primers. Several 
successive probe hybridisation assays to the colonies can then be 
performed. Between each assay the preceding probe can be removed, 
e.g. by thermal denaturation and washing. 
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Advantages of this embodiment of the present invention over other 
approaches for solving this problem are illustrated in the following 
example of a potential practical application. 



It is intended to detect which part of a gene (of, e.g., 2000 bases 
in size), if any, is related to a disease phenotype in a population 
of typically 1,000 to 10,000 individuals. For each individual, a PGR 
amplification can be performed to specifically amplify the gene of 
interest and to link a tag and a colony generating primer (refer to 
section G(iv), preparation of "tagged DNA") . 

In order to obtain a representative array of sample, one might want 
to array randomly 500,000 colonies (i.e. 10 times redundancy, so to 
have only a small probability of missing the detection of a sample) . 
With a colony density of 10,000 colonies per mm^ a surface of -7 mm x 
7 mm can be used. This is a much smaller surface than any other 
technology available at present time (e.g. The HySeq approach uses 
220 mm X 220 mm for the same number of samples (50,000) without 
redundancy) . The amount of reactants (a great part of the cost) will 
be proportional to the surface occupied by of the array of samples. 
Thus the present invention can provide an 800 fold improvement over 
the presently available technology. 



Using an apparatus to monitor the result of the 'in situ' sequencing 
or probe hybridisation assays, it should taJce on the order of 1 to 10 
seconds to image a fluorescent signal from colonies assayed using 
fluorescence present on a surface of -1 mm^ Thus, assuming that the 
bottlenecJc of the method is the time required to image the result of 
the assay, it talces of the order of 10 minutes to image the result of 
an assay on 50,000 samples (500,000 colonies). To provide 200 assays 
including imaging (on one or several 7 mm x 7 mm surfaces), using the 
present invention can take less than 36 hours. This represents a 20 
times improvement compared to the best method known at present time 
{HySeq claims 30 days to achieve a comparable task) . 
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Improvements (colony densities 10 times higher and imaging time of 1 
second) could allow for much higher throughput and finally the 
ultimately expected throughput could be about 2000 times faster than 
the best, not yet fully demonstrated, technology available at present 

time . 

Another advantage of using the present invention lies in the fact 
that it overcomes the problem arising with individuals who have 
heterozygous mutations for a given gene. While this problem may be 
addressed by existing sequencing methods to determine allelic 
polymorphisms, current high throughput mutation detection methods 
based on oligonucleotide probe hybridisation may lead to difficulties 
in the interpretation of results due to an unequal hybridisation of 
probes in cases of allelic polymorphisms and therefore errors can 
occur. In this embodiment of the present invention, each colony 
arises from a single copy of an amplified gene of interest. If an 
average of 10 colonies are generated for each individual locus, there 
will be an average of 5 colonies corresponding to one version of a 
gene and 5 colonies corresponding to the other version of the gene. 
Thus heterozygotic mutations can be scored by the number of times a 
single allele is detected per individual genome sample. 



(v) ' DNA xesequenoing 

This embodiment of the present invention provides a solution to the 
problem of identifying and characterising novel allelic polymorphisms 
within known genes in a large population of biological samples. 

in its preferred embodiment it consists in obtaining tagged DNA (each 
sample originating from a given individual has been tagged with a 
unique tag - see section G(iv)) . This encoded DNA can then be used 
for generating primary colonies on an appropriate surface comprising 
immobilised primers. Several successive assays of probe 
hybridisation to the colonies can then be performed wherein between 
each cyclic assay the preceding probe can be removed by thermal 
denaturation and washing. Preferably, the DNA sequence 3' to a 
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specific probe may be determined directly by 'in situ sequencing' 
(section I(iii), Methods of sequencing). 

The advantages of the present invention over other approaches for 
solving this problem are illustrated in the following example of 
potential practical application: 

It is desired to identify the variability of the sequence of a gene 
(of, e.g., 2 000 bases in size), if any, in a population of typically 
4 000 individuals. It is assumed t.^at a reference sequence of the 
gene is Icnown. For each individual, a PCR amplification can be 
performed to specifically amplify the gene of interest and link a tag 
and a colony generating primer. In order to obtain a representative 
array of sample, one might want to array randomly 40 000 colonies 
(i e. 10 times redundancy, so to have a small probability of missing 
the detection of a sample) . With a colony density of 10 000 colonies 
per mmS a surface of -2 mm x 2 mm can be used. 

using an apparatus with a CCD camera (having a 2000 x 2000 pixel 
chip) to monitor the result of the assay, it should talce of the order 
of 10 seconds to image a fluorescent signal from colonies on a 
surface of 4 mm^ If it is assumed possible to read at least 20 bases 
during one round of the assay, this requires 61 imaging steps (3n+l 
imaging steps are necessary for reading n numbers of bases) . If it 
is assumed that the bottleneOc of the method is the time to image the 
result of the assay, it talces of the order of 15 minutes to image the 
result of an assay on 4 000 samples (40 000 colonies) . To realise 
100 assays (on one or several 2x2mm^ surfaces) in order to cover the 
entire gene of interest, the present invention can allow the whole 
screening experiment to be performed in approximately one day. with 
one apparatus. This can be compared to the most powerful systems 
operational at the present time. 

In this embodiment of the present invention with conservative 
assumptions (colony density, imaging time, size of the CCD chip) , a 
throughput of 3.2x lO' bases per hour could be reached, i.e. a 400 
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fold improvement when compared to the most commonly used system at 
present time (current DNA sequencers have a typical throughput of the 
order of 8,000 bases read/hour). 



(vi) da novo DHA sequencing 

This embodiment of the present invention aims to solve the problem of 
sequencing novel genomes (or parts thereof) with low cost and in 
short time, where the sequence of the DNA is not known. Genomic DNA 
can be prepared, either directly from the total DNA of an organism of 
interest or from a vector into which DNA has been inserted. The 
prepared genomic DNA (from whatever source) can be used to generate 
DNA colonies. The DNA colonies can then be digested with a rare- 
cutting restriction enzyme, whose site is included in the linker, 
denatured and sequenced. 



Figure 16 depicts an example of de novo DNA sequencing. In this 
example, genomic DNA is fragmented into pieces of 100 to 2000 base 
pairs (see preparation of random DNA fragments, section G{i)). These 
fragments will be ligated to oligonucleotide linkers (Ila and lib, 
figure 16a) which include sequences specific for the grafted primers 
on the surface ('PI' and 'P2'), a sequence which is recognised by a 
rare-cutting restriction nuclease ( 'RE' ) and a sequence corresponding 
to a sequencing primer ( 'SP' ) , resulting in templates (III, figure 
16b) . Using t;his prepared DNA as template for DNA colony formation, 
one obtains primary colonies (IV, figure 16c) . These colonies are 
then digested with the corresponding restriction endonuclease and 
denatured to remove the non-attached DNA strand (V, figure 16d) . The 
sequencing primer (SP) is then annealed to the attached single- 
stranded template (figure 16e) . Incorporation and detection of 
labelled nucleotides can then be carried out as previously described 
(see section I(iii), Methods of Sequencing). 



In this embodiment, the throughput obtainable can be at least 400 
times higher than presently available methods. 
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(vii) mRNA gene expression monitoring 

This embodiment of our invention means to solve the problem of 
monitoring the expression of a large number of genes simultaneously. 

Its preferred embodiment is depicted in figure 17. 

Firstly, primary colonies are prepared, as depicted in figure 3. In 
its preferred form, the DNA used for this preparation is ^prepared 
genomic DNA' or ^tagged genomic DNA', as described in section G(i) 
and G{iii), respectively, and where the DNA is either from the whole 
genome of one (or several) organism(s) or from a subset thereof 
(e.g., from a library of previously isolated genes). In figure 17, 
the uppercase letters, ^^A", ^^B" and ^^D" represent colonies which have 
arisen from genes which exhibit high, medium and low expression 
levels, respectively, and "E" represents colonies arising from non- 
expressed genes (in real cases, all these situations may not 
necessarily be present simultaneously) . 

Secondly, the colonies are treated to turn then into supports (i.e. 
secondary primers) for secondary colony growing (step i in figure 
17a), as described in section E. At this stage (figure 17b), the 
treated colonies are represented by underlined characters (A, B, D, 
or E) . 

Thirdly, (step ii in figure 17b) this support for secondary colony 
growing is used to regenerate colonies from mRNA (or cDNA) templates 
extracted from a biological sample, as described in section C. If 
the template is mRNA, the priming step of colony regeneration will be 
performed with a reverse transcriptase. After a given number of 
colony amplification cycles, preferably 1 to 50, the situation will 
be as depicted in (figure 17c): the colonies corresponding to highly 
expressed genes (represented by the letter -A") are totally 
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regenerated, as their regeneration has been initiated by .any copxes 
71 .RHA; the colonies corresponding to genes of .ediu. expression 
levels (represented by the letters »b" and »B") , have been on y 
partially regenerated; only a few of the colon.es correspond ng to 
Tare .enea (represented by the letter M"), have been partially 
regenerated; the colonies corresponding to non-expressed sequences 
(presented by the letter »E'')^ regenerated at all. 

.astly, (Step iii in figure 17c), additional cycles of colony growing 
"e p f r^ed (preferably 2 to SO), and the colonies whxch have not 
Ten totally regenerated during the previous steps finally become 
To lly regenerated, "b" beco.es ^«B", M» beco.es «D« (figure 17d) : 
the coLnies corresponding to genes with high and .ediu. expression 
levels are all regenerated "A" and "B" or «B"; the colonies 
corresponding to genes with low levels of expression are not all 
regenerated "D" and «D"; the colonies corresponding to non-expressed 
sequences are not regenerated at all 

The relative levels of expression of the genes can be obtained by the 

following preferred methods: 

- Firstly levels of expression can be monitored by following the rat^ 
of regeneration of the colonies (i.e., by measuring the a.oun of ONA 
inside a colony after different number of colony growing cycles 
during step (iii)) as the rate at which a colony .s 
b liLed the nu^er of ^A (or cO.A) ^^^^^^^r^^"::^:; 
regeneration of that colony (at first approxi««t.cn, the number of 
OHl Llecules after n cycles, noted M(n), in a colony ^-^^J^^^'^ 
"generation should be given by M(n, = M^r'-., where M, .s the nu^r 
of .olecules which initiated the regeneration of the colony, r .s the 
growing rate and n is the nuinber of cycles); 

- secondly, levels of expression can be monitored by — 
each gene, the number of colonies which have been regenerated and 
comparing this nunO^er to the total nun^er of colon.es corresponding 
to that gene. These measurements will generally give access to the 
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relative expression levels of the genes represented by the colonies. 
The identification of the colonies is preferably performed by 
fingerprinting, in a manner essentially similar to embodiment, 
section I(iv). Note that encoding the DMA samples is not required, 
but can be considered as an alternative to the direct identification 
of the DNA in the colonies. This can be of practical interest 
because with coding, the same codes (thus the same oligonucleotides 
involved in assaying the code) can be used for any set of genes, 
whereas without code, a different set of specific oligonucleotides 
has to be used for each set of genes. 

This embodiment of our invention has many advantages if compared to 
current state of the art including: a very high throughput; no 
requirement for prior amplification of the mRNA (even though prior 
a„»plification is compatible with or invention); small amounts of 
samples and reactants are required due to the high density of samples 
with our invention; the presence of highly expressed genes has no 
incidence on the ability to monitor genes with low levels of 
expression; the ability to simultaneously monitor low and high levels 
of expression within the set of genes of interest. 

When the initial DNA in the generation of the primary DNA colony is 
„ade from the DNA of a whole genome, this embodiment also provides 
the following features, there is no interference between genes 
expressed at high level and at low level even though one has not 
performed specific amplification of the genes of interest This is a 
unique feature of the use of this invention: specific amplification 
is not possible because the initial assumption of this embodiment is 
to monitor the expression genes which may have not yet been isolated, 
thus which are unlcnown, and thus for which no specific (unique) 
sequences are known and which specific sequences would have been 
necessary for specific gene amplification. The ability of our 
invention to perform this type of mRNA expression monitoring is due 
to the fact that when the primary colonies are prepared, 
statistically, each piece of the initial genome will be -P-sented 
by the same number of colonies. Thus, frequent and rare DNA will 
initiate the same number of colonies (e.g.. one colony per added 
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gence .olecule, . Quantitative infcnnation n.ight be <'''^^J-'* ^^^^ 
from frequent and rare mRNAs by monitoring the growing rate of the 

colonies . 

(viii) isolation and characterisation of novel expressed genes 

This embodiment of our invention means to solve the problem of 
isolating the genes which are specifically induced under given 
conditions, e.g., in specific tissues, different straxns of a g.ven 
species or under specific activation. A practical example xs the 
identification of genes which are up or down regulated after drug 
administration. 

The preferred embodiment for isolating genes from a specific or 
activated biological sample (hereafter called target sample) wh.ch 
are up-regulated compared with a reference biological sample 
(hereafter called reference sample) is depicted in figure 18. 

rirstly, primary colonies are prepared (figure 18a, . In its preferred 
form, the DHA used for this preparation is prepared genomxc PNA or 
tagged genomic DNA, as described in sections G(i) and G(xx). 
respectively, where the om is either from the whole genome of one 
(or several) organism(s, or from a subset thereof (eg, from a 
ibrary of previously isolated genes,, and where both the.prxmers 
used for colony generation (hereafter called PI and P2, -taxn a 
endonuclease restriction site. In figure 18a, "A" represents co^ .es 
Which have arisen from genes expressed in both the reference sample 
and the target sample, "B" represents colonies which have arxsen from 
genes Lpressed only in the reference sample, "C" represents colonxes 
Ich have arisen from genes expressed only in the -r get sample and 
-D" represents colonies arising from non-expressed genes (xn real 
cases, all these situations may not necessarily be present 
simultaneously) . 
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secondly, primary colonies are then treated to generate secondary 
primers as the support for secondary colony growing (step i in figure 
18a). At this stage (b) , the colonies are represented as underlined 
characters (A,B,C,D) . 



Thirdly, (step ii in figure 21b) the secondary primers are used to 
regenerate colonies using mRNA or cDNA (represented by «mA + mB") 
extracted from the biological reference sample as a template, as 
described in G(v). If the template is mBNA, the first elongation step 
of colony regeneration will be performed with a reverse 
transcriptase. After enough colony growing cycles, preferably 5 to 
100, only the colonies corresponding to genes expressed in the 
reference sample ("A" and "B") will be regenerated, as depicted in 
(figure IBc) . 

in step (iii). the colonies are digested with a restriction enzyme 
(represented by RE) which recognises a site in the flanking prxmer 
sequences. PI and P2, which are grafted on the support and which were 
the basis of primary colony generation. Importantly, only the 
colonies which have been regenerated during step (ii) will be 
digested. This is because the support for secondary colony growth is 
made of single stranded DNA molecules, which can not be digested by 
the restriction enzyme. Only the regenerated colonies are present xn 
a double stranded form, and are digested. After digestion, the 
situation is the one depicted in figure 18d. The colonies 
corresponding to the genes expressed in the reference sample have 
totally disappeared, i.e., they are not even present as a support for 
secondary colony growth, and the colonies corresponding to genes 
expressed only in the target sample «C" and the colonies 
corresponding to non-expressed genes "D" are still present as a 
support for secondary colony generation. 

in step (iv), mRNA (or cDNA) (represented by "mA + mC") extracted 
from the target sample is used to generate secondary colonies. 
Because colonies corresponding to mA and mB no longer exist, only the 
colonies corresponding to mC can be regenerated (ie. Only the mRNA 
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specifically expressed in the target sample,. After sufficient number 
Z coLy . owing cycles (preferably S to 100). the situation .s ch 
It lnly the colonies corresponding to genes expressed specifically 
in the target sample are regenerated ("C", figure 18e) . 

I, step (V), the regenerated colonies "C" are used to generate copies 

/oL ;hat they contain by performing several JP" 
,0) colony growing cycles in the presence of the prx.er PI and P2, 
as described in section D of the present invention. A PCR 
amplification is then performed using PI and P2 .n solution 
described in section D) and the amplified DHA characterised by 
classical methods. 

The preferred embodiment for isolating genes from ^ ^P^^^J ^ 
activated biological sample which are less expressed than xn a 
Tef rence biological sample is depicted in figure 19. The different 
p involved in this procedure are very similar to those involved 
in the isolation of gene which are more regulated than in t e 
ference sample, and the notation are the same as in figur 8. 

• o i-o inverse the order used to regenerate the 

the ^ i' th. .n. extracted tKe » '""'^ ^ ^'^^ 

1 »« * ^ „,ere„ce =«.le 

corresponding to genes wnicn <* t- -..^.^^ fioure 

but not in the target sample is recovered and amplified (. B , figure 



19f) 
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Clai«s 



1. 

of: 



method of nucleic acid amplification, comprising the steps 



A. providing a plurality of primers that are inunobilised but that 
have one end exposed to allow primer extension; 

B. allowing a single stranded target nucleic acid molecule to 
anneal to one of said plurality of primers over part of the 
length of said single stranded nucleic acid molecule and then 
extending that primer using the annealed single stranded 
nucleic acid molecule as a template, so as to provide an 
extended immobilised nucleic acid strand; 

C. separating the target nucleic molecule from the extended 
immobilised nucleic acid strand; 

D allowing the extended immobilised nucleic acid strand to anneal 
' to one of said plurality of primers referred to in step A) and 
then extending that primer using the extended imn«bilised 
nucleic acid strand as a template, so as to provide another 
extended immobilised nucleic acid strand; and optionally, 

E. separating the annealed extended immobilised nucleic acid 
strands from one another. 

A method according to claim 1, further comprising: 

F using at least one extended immobilised nucleic acid strand to 
repeat steps D) and E), so as to provide additional extended 
immobilised nucleic acid strands and, optionally, 

G. repeating step F) one or more times. 

3 A method according to claim 1 or claim 2, wherein said single- 
stranded target nucleic acid is produced by providing a given nuclexc 
acid sequence to be amplified (which sequence may be known or 
unknown) and adding thereto a first nucleic acid sequence and a 
second nucleic acid sequence; wherein said first nucleic acid 
sequence hybridises to one of said plurality of primers and said 
second nucleic acid sequence is complementary to a sequence which 
hybridises to one of said plurality of primers. 



2. 
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4 A method according to any preceding claim; wherein said s.ngle- 
siranded target nucleic acid is produced by providing a given nucle.c 
acid sequence to be amplified (which sequence may be known or 
unknown) and adding thereto a first nucleic acid sequence and a 
second nucleic acid sequence; wherein said first nucle.c ac.d 
sequence hybridises to one of said plurality of primers and saxd 
second nucleic acid sequence is the same as the sequence of one of 
said plurality of primers. 

5 A method according to claim 3 or claim 4 wherein said first and 
second nucleic acid sequences are provided at first and second ends 
of said single-stranded target nucleic acid. 

6 A method according to any of claims 3 to 5, wherein a tag is 
also added to the given nucleic acid sequence, said tag enablxng 
amplification products of the given nucleic acid sequence to be 
identified. 

7 A method according to any preceding claim wherein the plurality 
of primers is a plurality of primers that have the same sequence. 

8 A method according to any of claims 1 to 6, wherein the 
plurality of primers comprises at least two different types o 
primer, one type having a different sequence from another type. 

9 A method according to claim 8, wherein the plurality of primers 
consists of 2" different types of primer; wherein n is an integer. 

10. A method according to claim 8 or claim 9, wherein the different 
types Of primer are present in substantially the same concentrations 



as one another 
11 



IX A method according to any preceding claim, wherein the primers 
are substantially homogeneously dispersed over a given area 

12. A method according to any preceding claim, wherein the primers 
are located in a predetermined arrangement (e.g. in a grid pattern, 
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13. A method according to any preceding claim, wherein a supply of 
nucleotides and a nucleic acid polymerase are used to extend primers 

14. A method according to any preceding claim, wherein heating is 
used to separate annealed nucleic acid strands 

15. A method according to claim 14 when dependent upon claim 13, 
wherein the nucleic acid polymerase is not rendered inactive by the 
heating conditions used to separate annealed nucleic acid strands. 

16. A method according to claim 15, wherein said nucleic acid 
polymerase is tag polymerase, is another polymerase that is derivable 
from a thermophilic organism; or is a thermostable derivative 
thereof. 

17 . A method according to any preceding claim, wherein said primer 
extension results in the incorporation of one or more detectable 
labels (e.g. fluorescent labels or radiolabels) into extended 
immobilised nucleic acid strands. 

18. A method according to any preceding claim, further including 
the step of treating one or more extended immobilised nucleic acid 
strands so as to release a nucleic acid molecule or a part thereof. 

19. A method according to claim 18, wherein said treating consists 
of cleavage with a restriction endonuclease or with a ribozyme. 

20. A method according to any preceding claim, wherein one or more 
of said primers has a restriction endonuclease recognition site or a 
riiozyme recognition site or has part of such a site, which part 
becomes complete when primer extension occurs. 

21. A method according to any preceding claim that is automated to 
allow repeated cycles of nucleic acid amplification. 

22. A method according to any preceding claim, when used to amplify 
a plurality of different nucleic acid sequences 
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23. A method according to claim 22, when used to amplify a 
plurality of different nucleic acid sequences simultaneously. 

24. A method according to claim 22 or 23, wherein said different 
nucleic acid sequences are each provided with a first and second 
nucleic acid sequence as described in any of claims 3 to 5, said 
first and second nucleic acid sequences being the same for the each 
of the different nucleic acid sequences. 

25. A method according to any of claims 22 to 24, wherein said 
different nucleic acid sequences are each provided with a different 
tag so that the different sequences can be distinguished from one 
another. 

26. A plurality of immobilised nucleic acids producable by a 
method according to any preceding claim. 

27 A plurality of immobilised nucleic acids in the form of one or 
more distinct areas on a surface, each area comprising a plurality of 
identical nucleic acid strands and a plurality of identical 
complementary strands thereto; wherein each nucleic acid strand 
within such an area is located so that another nucleic acid strand is 
located on the surface within a distance of the length of that 
Strand. 

28 A plurality of immobilised nucleic acids according to claim 26 
or claim 27, wherein there is at least one distinct area present per 

of surface on which the nucleic acids are immobilised. 

29 A plurality of immobilised nucleic acids according to claim 27, 
wherein the number of distinct areas/mm^ of surface on which the ^ 
nucleic acids are immobilised is greater than 1, greater than 10 , 
greater than 10^ or greater than lOV 

30 The use of a method according to any of claims 1 to 25 or of a 
plurality of immobilised nucleic acid molecules according to any of 
claims 26 to 29 in providing nucleic acid molecules for sequencing. 
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31 The use according to claim 30, wherein sequencing is performed 
by extending primers hybridised to single stranded nucleic acid 
molecules and detecting the nucleotides used in primer extension. 

32 The use according to claim 31, wherein said primers hybridise 
to the same sequences as the primers used in the method of claim 1 to 
provide the immobilised nucleic acid molecules for sequencing. 

33 The use according to claim 31, wherein said primers have the 
same sequences as the primers used in the method of claim 1 to 
provide the immobilised nucleic acid molecules for sequencing. 

34 The use according to claim 30, wherein nicks are provided in 
double-stranded nucleic acid molecules, nick translation is performed 
and the nucleotides used in nick translation are detected. 

35 The use according to claim 30, wherein an RNA polymerase is 
used to synthesise an RNA strand from a DNA molecule and the 
nucleotides used to produce said RNA strand are detected. 

36 The use according to any of claims 31 to 35, wherein all or at 
least some of said nucleotides are labelled nucleotides. 

37. The use according to claim 36, wherein said labelled 
nucleotides all have the same label. 

38. The use according to claim 36 or claim 37, wherein said 
labelled nucleotides are fluorescence labelled 

39. The use according to any of claims 36 to 38, wherein a mixture 
of labelled and non-labelled nucleotides are used. 



The use according to claim 39. wherein the labelled nucleotides 
comprise less than 50 % of the nucleotides used. 



40 



41. The use according to claim 40, wherein the labelled nucleotides 
comprise less than 10 % of the nucleotides used. 
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42 The use according to any of claims 30 to 41, wherein the 
sequencing is parallel sequencing of nucleic acid molecules present 
in at least 2 different distinct areas. 

43 The use according to any of claims 30 to 41, wherein the 
sequencing is parallel sequencing of nucleic acid molecules present 
in over 10 different distinct areas. 

44 The use according to any of claims 30 to 41, wherein the 
sequencing is parallel sequencing of nucleic acid molecules present 
in over 100 different distinct areas. 

45 The use according to any of claims 30 to 41, wherein the 
sequencing is parallel sequencing of nucleic acid molecules present 
in over 1000 different distinct areas. 

46 The use according to any of claims 30 to 41, wherein the 
sequencing is parallel sequencing of nucleic acid molecules present 
in over 1000000 different distinct areas. 

47. The use according to any of claims 30 to 46, wherein a 
plurality of different sequences are determined in parallel. 

48 The use of a method according to any of claims 1 to 25 or of a 
plurality of immobilised nucleic acid molecules according to any of 
claims 26 to 29 in providing amplified nucleic acid molecules for 
diagnosis. 

4 9 The use of a method according to any of claims 1 to 25 or of a 
plurality of immobilised nucleic acid molecules according to any of 
claims 26 to 29 in providing amplified nucleic acid molecules for 

screening. 

50 The use of a method according to any of claims 1 to 25 or of a 
plurality of immobilised nucleic acid molecules according to any of 
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claims 26 to 29 in providing amplified nucleic acid molecules to be 
used as a support for other components. 

51 The use of a method according to any of claims 1 to 25 or of a 
plurality of immobilised nucleic acid molecules according to any of 
claims 26 to 29 in generating additional nucleic acid molecules in 
free (rather than immobilised) form. 

52. The use of a method according to claim 51 in in situ RNA 
synthesis 

53 The use of a method according to any of claims 1 to 25 or of a 
plurality of immobilised nucleic acid molecules according to any of 
claims 26 to 29 in monitoring gene expression 

54 The use of a method according to any of claims 1 to 25 or of a 
plurality of immobilised nucleic acid molecules according to any of 
claims 26 to 29 in identifying nucleic acid molecules with gene 
products that are rarely expressed. 

55 The use of a method according to any of claims 1 to 25 or of a 
plurality of immobilised nucleic acid molecules according to any of 
claims 26 to 2 9 in identifying heterozygous individuals 

56 The use of a method according to any of claims 1 to 25 or of a 
plurality of immobilised nucleic acid molecules according to any of 
claims 26 to 29 in nucleic acid fingerprinting 

57 An apparatus for performing a method as described in any of 
claims 1 to 25; comprising a plurality of immobilised primers, a 
nucleic acid polymerase, a plurality of nucleotides and means for 
separating annealed nucleic acid strands. 

58 An apparatus according to claim 57, wherein the means for 
separating annealed nucleic acid strands comprises a controlled 
heating means. 

59 An apparatus for analysing a plurality of nucleic acid 
„.olecules according to any of claims 26 to 29, wherein said apparatus 
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comprises a source of reactants and detector means for detecting one 
or more signals produced after said reactants have been applied to 
said nucleic acid molecules. 

60. An apparatus according to claim 59 wherein said detector means 
has sufficient resolution to distinguish between the distinct areas 
referred to in claim 26. 

61. An apparatus according to claim 59 or claim 60 comprising a 
charge coupled device (CCD) . 

62. An apparatus according claim 61 wherein said charge coupled 
device (CCD) is operatively connected with a magnifying device (e.g. 
a microscope) . 

63. A kit for use in screening, diagnosis or in nucleic acid 
sequencing; comprising a plurality of immobilised nucleic acid 
according to any of claims 26 to 29. 



64. 



The invention substantially as hereinbefore described. 
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FIG. Uq 

Colony generation on tubes f unct ionalised with 
ligonuiileotidG p57 (control) Results of visualisation of fluorescent 
icrospheres Hybridisation with probe detects 0.10 fmol/tube 




FIG. Ab 

Colony generation on tubes funct ionalised with 
oligonucleotide p58. Results of visualisation of fluorescent 
microspheres. Hybridisation with probe detects 6.2 fmol/tube 
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FIG. 6(a) 

DNA amplification - detection of colonies with DNA intercalating dyes 

ii) with template 




FIG. 6(b) 

DNA amplification, detection of colonies by specific probe hybndisati 
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FIG. 8-2 
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FIG. 15 

m situ Sequencing 
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